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A  BAYESIAN  APPROACH  TO  ACOUSTIC  IMAGING  AND  OBJECT 
CLASSIFICATION  BY  HIGH  FREQUENCY  SONAR 

INTRODUCTION 

Simply  stated,  the  objective  of  this  work  is  to  determine  an  optimum  (in  some  sense) 
space-time  signal  processor  for  object  classification  by  a  high  frequency  sonar.  Only  active  sonar 
operating  at  a  frequency  and  range  of  the  order  or  10  kHz  (or  greaterjd  and  several  kilometers, 
respectively,  is  addressed.  Applications  of  such  a  sonar  are  to  underwater  vehicles,  principally 
weapons  and  Unmanned  Undersea  Vehicles  (UUVs).  The  approach  is  to  formulate  the 
classification  problem  as  a  statistical  decision  among  several  alternative  hypotheses,  each 
corresponding  to  a  particular  class  of  true  or  false  target.  (The  null  hypothesis  corresponds  to  only 
noise  in  the  data.)  The  criterion  for  optimality  of  the  test  is  the  Bayes  risk,  which  is  to  be 
minimized.  This  criterion  leads  to  a  test  based  on  the  likelihood  ratio,  the  ratio  of  the  likelihood 
functions  of  the  measured  data.  The  data  consist  of  a  discrete  set  of  time  series  representing 
measurements  of  the  received  pressure  field  at  the  various  positions  of  the  transducer  elements  in 
the  sonar  array.  In  general,  the  measurements  can  be  represented  as  a  vector-valued  stochastic 
processes  x(t),  the  components  corresponding  to  the  measurements  at  the  element  positions  within 
the  receive  array.  The  process  x(t)  is  composed  of  signal  and  noise,  also  vector-valued  processes. 
The  signal's  statistical  properties  vary  among  the  alternative  hypotheses  that  correspond  to  the 
various  classes  of  real  and  false  targets.  As  shown  subsequently,  the  signal  itself  is  modeled  as  a 
randomly  weighted  sum  of  time-delayed  replicas  of  the  waveform  of  the  transmitted  signal.  The 
received  signal  is  then  statistically  characterized  by  a  multivariate  probability  density  function  (pdf) 
of  the  random  weights.  The  pdf  (its  family  or,  more  simply,  its  moments)  differs  for  each  of  the 
various  signal  (i.e.,  target)  classes.  The  principal  results  in  this  report  are  derived  assuming  a 
Gaussian  multivariate  pdf;  however,  the  non-Gaussian  case  as  well  as  more  general  signal  models 
are  discussed. 

In  summary,  the  active  sonar  classification  is  approached  as  a  likelihood  ratio  test  of  multiple, 
alternative  hypotheses  versus  a  noise-only  null  hypothesis.  The  data  are,  in  general,  vector-valued 
stochastic  processes  representing  measurements  from  individual  elements  within  a  sonar  array.  An 
explicit  form  is  assumed  for  the  received  signal  model,  which  is  statistically  characterized  for  each 
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alternative  hypothesis  (target  class).  Explicit  results  are  derived  for  the  likelihood  ratio,  and 
various  performance  characteristics  are  shown.  Moreover,  the  optimal  processor  is  examined  from 
the  perspective  of  acoustic  image  processing.  Generalizations  of  the  results  are  indicated  and  in 
some  cases  addressed  in  detail  (e.g.,  the  case  of  moving  targets). 


2 


CLASSIFICATION  PROBLEM 


The  sonar  detection  problem  is  usually  formulated  as  a  statistical  hypothesis  test  (references 
1  and  2),  and  the  test  is  binary.  Under  the  null  hypothesis  the  observations  or  data  (i.e.,  the 
sensors'  outputs)  consist  of  noise  alone;  under  the  alternative  hypothesis  !H\,  the  observations 
consist  of  noise  and  a  signal  that  is  the  sensors'  response  to  the  presence  of  a  target  or  other 
interesting  object.  The  sonar  classification  problem  can  be  similarly  formulated  except  that  the  test 
is  not  binary.  What  distinguishes  the  classification  problem  is  that  there  are  at  least  two  alternative 
hypotheses  because,  in  addition  to  the  signal  produced  by  the  real  target,  another  signal  can  appear 
in  the  observations.  In  the  case  of  active  sonar,  this  other  signal  results  from  the  sensors’ 
responses  to  the  pressure  field  scattered  from  an  object,  distinct  from  the  real  target;  but  it 
resembles  in  some  significant  way  the  signal  produced  by  the  real  target. 

In  general,  there  may  be  several  of  these  false-target  signals  each  corresponding  to  a  distinct 
class  of  scattering  objects  (either  natural  or  man-made)  and  each,  along  with  additive  noise, 
representing  additional  alternative  hypotheses  regarding  the  constitution  of  the  data.  It  follows  that 
the  classification  problem  can  be  formulated  as  a  multiple  hypothesis  test  with  a  null  hypothesis  ^ 
and  alternative  (exhaustive  and  mutually  exclusive)  hypotheses  where  i  =  1,2 . 1,  and  I  >  1; 

Mq'-  x(t)  =  n(t),  (la) 

x(t)  =  Sj(t)  +  n(t),  i  =  1,2 . 1,  (lb) 


where 

t  denotes  time,  and  Ti  <  t  <  T2,  T[  and  T2  being  the  endpoints  of  the  observation  interval, 
x(t)  =  [xi(t),  X2(t),...,XN(t)]T,  an  N  x  1  complex  vector, 

xn(t)  denotes  the  output  from  nth  sensor  element  (e.g.,  the  nth  transducer  element  in  an 
acoustic  array), 

Sj(t)  =  [sa,(t),  Si2(t),...,  SjN  (t)]T,  an  N  x  1  complex  vector, 

Sjn  (t)  denotes  the  signal  component  in  the  n^  sensor  output, 
n(t)  =  [ni(t),  n2(t),...,  nN(t)F,  an  N  x  1  complex  vector, 
n„(t)  denotes  the  noise  component  in  the  nth  sensor  output, 
n  =  1,2,...,  N,  and  vT  denotes  the  transpose  of  any  vector  v. 

The  vector  x(t)  is  a  stochastic  process  that  is  the  output  (voltage)  of  an  array  of  acoustic 
transducers  and  comprises  the  data.  Let  Si(t)  be  the  signal  component  of  x(t)  corresponding  to  the 


real  target;  Sj(t)  is  the  false  signal  component  in  x(t)  corresponding  to  each  of  the  false  target 
hypotheses  i  =  2,3,— ,1.  The  statistical  hypothesis  test  decides  among  &o,  based  on 

measurement  over  the  interval  [Ti.TJ  of  x(t)  or,  more  precisely,  realizations  of  the  vector-valued 
process  x(t).  Note  that,  in  general,  S;(t)  is  a  stochastic  process,  i  =  1,2,...,I. 

These  processes  are  assumed  to  be  zero-mean,  and  the  signals  and  noise  are  uncorrelated; 
that  is. 


E[n(t)]  =  0,  (2) 

E[Si(t)]  =  0,  i  =  1,2,. ..,1,  (3) 

and  thus 

E[x(t)l^]=0,  i  =  0,1,— ,1,  (4) 

and 

E[n(tj)  sj  (t2)J  =  0,  (5) 


for  all  t,  ti,  t2  in  [T^T^I  (v*  denotes  the  complex  conjugate- transpose  of  any  vector  v).  The 
N  x  N  covariance  matrices  Kn  (ti ,t2)  and  Ksi  (ti,t2)  are  defined  by 

Kn(tl,t2)  =  E[n(t1)nt(t2)],  (6) 

Ksi(t1,t2)  =  E[Si(t1)sf(t2)],  i  =  1,2,...,!,  (7) 


and 

Kx,(ti,t2)  =  E[x(ti)x+(t2)l^],  i  =  0,1,... I, 
for  all  ti,t2  in[Ti,T2].  Then,  from  equations  (1)  and  o  >. 


Kxi(ti,t2)  =  Ksi(ti,t2)  +  Kn(ti,t2),  i  =  1.2.  ,1. 

and 


(8) 


(9) 
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Kxo(ti,t2)  =  Kn(ti,t2), 


for  all  ti,t2  in  [Ti,T2].  Note  that,  from  equation  (8), 
K^.(ti,t2)  =  K*.(t2,ti), 


(10) 


for  all  ti,t2  in[T1?T2]  and  i  =  0,1, 2,..., I. 

For  the  binary  case  (1  =  1),  the  optimal  (Bayesian)  test  for  vector-valued  processes  (N  >  1) 
is  derived  in  references  3  and  4  based  on  the  vector-valued  Karhunen-Loeve  expansion  (reference 
5).  The  optimal  (minimum  probability  of  error)  test  in  the  general  case  of  multiple  alternative 
hypotheses  (I  >  2)  is  given  in  reference  6,  but  is  restricted  to  the  scalar  case  (N  =  1)  and  is  not 
directly  applicable  to  multiple  channels  (sensors).  Multiple  alternatives  and  vector-valued  data  are 
treated  in  reference  7,  but  only  when  the  signal  vector  is  assumed  known  a  priori  for  all  alterna¬ 
tives.  In  a  treatment  of  diversity  reception  (reference  8),  a  similar  case  is  addressed  wherein  a 
multiplicative,  random  disturbance  is  present  along  with  additive  noise;  however,  the  multiplicative 
noise  is  identical  under  all  alternative  hypotheses.  Hence,  such  a  model  does  not  apply  to  the 
classification  problem  as  formulated  here.  When  the  signal  and  noise  processes  are  stationary, 
then,  of  course,  the  covariance  matrix,  of  the  data  x(t)  can  be  written  as  Kx^ti.t^  =  Rx,(ti  -  fz), 
for  all  ti,t2  in  [Ti,T2]  and  i  =  0,1, 2,.. .,1.  In  this  case  the  analysis  proceeds  using  the  spectral 
density  matrix  which  exists  as  the  Fourier  transform  of  Rxi  (tj  - 12).  This  is  well-covered  in  the 
literature;  references  9  -  13  are  representative.  The  signals  and  noise  received  by  passive  sonar 
can  usually  be  assumed  to  be  stationary.  However,  when  active  sonar  is  used,  the  signals  and 
noise  (reverberation)  are  inherently  nonstationary;  they  are  received  as  the  pressure  field 
backscattered  from  complex  objects  (i.e.,  objects  with  nonuniform  scattering  strength).  Therefore, 
it  is  the  inherent  nonstationarity  of  the  signals  that  is  the  basis  for  the  discrimination.  To  assume 
stationarity  would  be  to  overlook  a  characteristic  feature  of  the  signal  that  reveals  the  spatial 
character  of  a  scattering  object. 

In  addition  to  reference  6  cited  previously,  there  are  theoretical  treatments  of  the  general 
classification  problem,  references  14  and  15,  for  example.  While  these  address  the  case  of 
I  >  2,  they  typically  consider  only  the  case  of  a  scalar  process  (N  =  1).  Moreover,  they  do  not 
introduce  models  of  the  signal  processes  (other  than  the  Gaussian  assumption).  Signals  must  be 
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modeled  to  infiase  into  the  classifier  their  distinguishing  physical  characteristics.  In  the  next 
section,  a  simple  but  general  model  of  the  signal  processes  is  introduced;  it  applies  to  active  sonar. 

In  summary,  the  unique  contribution  presented  herein  is  the  systematic  integration  of  all  of 
the  essential  features  of  the  active  sonar  classification  process:  (1)  multiple  alternative  hypotheses 
(I  >  2);  (2)  multiple  channels  (sensors)  represented  by  vector-valued  processes  (N  >  2); 

(3)  a  stochastic,  but  not  necessarily  stationary,  model  of  the  alternative  signal  processes  that 
applies  to  the  active  sonar  problem;  (4)  an  optimal  (Bayesian)  decision  rule  where  the  costs  of 
errors  (incorrect  decisions)  are  not  necessarily  identical.  Moreover,  a  concise  expression  for  the 
optimal  processor  is  derived. 
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SIGNAL  MODEL 


The  i^  signal  vector  is,  as  in  equation  (I),  given  by 
Si(t)  =  [Sii(t),si2(t),...,siN(t)F,  i  =  1,2 . 1. 

Let  the  n^1  component  of  Si(t)  be  represented  as  a  sum  of  time-delayed  and  amplitude-weighted 
replicas  of  the  transmitted  waveform  f(t);  i.e.,  let,  for  n  =  1,2,...,N, 

Ki 

Sin(0  =  ^  ^ikf(t  —  ^ink)>  Tj  <  t  <  T2,  (1 1) 

k=l 


K 

where,  for  each  i  =  1,2,... I,  {a*}  1 1  is  a  sequence  of  complex  random  variables  with  the  following 
properties: 


EM  =  0, 


(12) 


and 

E(^jaki)  =  [**]/*. 


(13) 


where  j,  k,  1  =  l,2,...,Ki;  Kai  is  the  (Ki  x  K,)  covariance  matrix  of  the  random  variables 

K' 

{3*}/,!=  1,2,..., I;  and  8ik  is  the  Kronecker  delta  function. 

If  it  is  assumed  that  a  given  object  —  the  i^1  one  —  is  a  set  of  K,  distinct  stationary 
scatterers,  then  the  signal  model,  equation  (1 1),  is  interpreted  as  follows.  The  index  k  is  over  the 
set  of  scatterers:  each  value  of  k  corresponds  to  one  and  only  one  scatterer,  a*  is  its  scattering 
coefficient;  Tin*  is  the  delay  from  the  instant  (t  =  0)  of  transmission  (from  a  reference  transducer 
element  in  the  array)  to  reception  by  the  element  of  the  array  of  the  waveform  scattered  by  the 
k1^  scatterer.  Under  this  interpretation,  analogous  to  the  so-called  Lagrangian  (or  material) 
description  in  fluid  mechanics,  Kj  and  the  delays  {Tin*}  1  are  not  fixed  at  known  values  within  a 
reference  coordinate  system.  In  general,  they  would  be  treated  as  random  variables  (at  least  for 
naturally  occurring  false  targets);  thus,  specification  of  their  joint  distribution  function  is  required. 
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Moreover,  K;  must,  in  general,  be  assumed  to  be  a  random  variable  having  a  discrete  distribution 
such  as  the  Poisson,  for  example.  This  random  model  obtains  from  the  empirical  observation  that 
natural  objects  that  appear  as  false  targets  are,  at  sufficiently  high  frequencies,  ensembles  of 
reflecting  facets  whose  spatial  distributions  are  random.  Since  determination  of  realistic  probability 
distribution  functions  of  the  appropriate  random  variables  is  difficult,  the  Lagrangian  description  is 
problematic  and  will  not  be  further  considered. 

A  more  suitable  alternative  is  the  Eulerian  (or  spatial)  description.  Following  this 
description,  a^  is  interpreted  as  the  random  scattering  coefficient  corresponding  to  a  specific 
volume  element  fixed  within  a  given  (three-dimensional)  coordinate  system.  The  origin  of  the 
coordinate  system  is  conveniently  fixed  at  some  reference  point,  a  transducer  element  in  the 
receiving  array,  for  example,  as  in  figure  1.  The  figure  illustrates  an  example  with  spherical 
coordinates  and  a  uniform  planar  array  in  a  nonreffactive  medium.  1  he  set  of  volume  elements 
defines  a  region  in  the  medium,  a  so-called  test  region,  for  which  one  and  only  one  of  {H}q  is 
true.  More  precisely,  the  test  region  is  defined  as  a  set  of  disjoint  volume  elements  called  cells  that, 
it  is  assumed,  contain  at  most  one  scatterer.  Otherwise,  the  cell  size  remains,  for  now, 
unspecified.  When  the  test  region  is  occupied  by  the  i1*1  object,  the  scattering  from  the  region  is 
characterized  by  the  set  of  coefficients  {aid  if1,  for  i  =  1,2,...,I.  The  delay  is  now  the  delay 
from  t  =  0  to  reception  at  the  n^1  transducer  element  of  the  waveform  scattered  from  the  cell 
(or,  more  precisely,  that  pan  of  the  i^1  object  that  occupies  the  cell).  It  can  be  assumed  that,  for 
each  H.;\  -  1,2,...  J,  the  test  region  and  its  decomposition  are  specified  a  priori.  Thus,  the 
number  of  cells  Kj  and  the  position  of  each  cell  (and,  hence,  the  values  of  the  delays  {Tu*}  { 1 )  are 
specified  constants  for  all  i  =  1,2,...,I.  Of  course,  as  search  for  the  actual  target  progresses,  the 
test  region  will  shift  in  some  prescribed  way  through  the  insonified  volume  during  each  transmit 
cycle.  The  values  of  {link}  x  1  and  K;  will  change  accordingly. 

By  adopting  the  Eulerian  description,  the  randomness  of  the  signal  is  characterized  in  the 
model  solely  by  the  statistical  properties  of  the  coefficients  {ai]()1  \  which,  to  second  order,  are 
specified  by  the  covariance  matrix  Kai,  i  =  1,2,...,I.  In  what  follows,  Kai  is  not  required  to  be  of 
full  rank;  therefore,  the  model  allows  for  coherence  among  the  waveforms  scattered  from  each  cell. 
In  practice,  the  dimensions  and  decomposition  of  test  regions  and  estimates  of  Kai  will  be 
determined  directly  from  both  experimental  data  and  calculations  from  scattering  models  of  the 
hypothesized  targets.  Of  course,  the  dimensions  of  the  test  regions  would  match,  roughly  at  least, 
the  characteristic  dimensions  of  the  targets. 
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Signal  Model 


To  summarize,  equation  (1 1)  is  rewritten  in  vector  form: 


Si(t)  =  Fi(t)ai,  Ti  £  t  <  T2,  i  =  1,2 . 1,  (14) 

where  Si(t)  is  as  in  equation  (1);  a;  =  [ai1,ai2,...,ajKi]T,  a  Kj  x  1  complex  vector;  Fj(t)  is  an 
NX  K,  complex  matrix  with  elements  given  by  [F^t)],*  =  f(t - Tin*),  n  =  1,2,...,N  and 
k  =  1,2,. ..Ki.  The  following  assumptions  pertain  to  equation  (14): 

1 .  Implicit  in  equation  (14)  is  that  scattering  from  a  given  cell  is  frequency  independent  at 
least  over  the  band  of  f(t);  otherwise,  the  summand  on  the  right  side  of  equation  (14)  would  be  a 
temporal  convolution. 

2.  The  test  region  comprises  K;  disjoint  volume  elements  (cells)  that  contain,  at  most,  one 

scattering  facet.  00 

3 .  f(t),  the  transmit  waveform,  is  known  and  has  finite  energy  J  |f(t)|2dt  =  Ef ; 

— oo 

otherwise,  f(t)  is  arbitrary,  as  consistent  with  assumption  1. 

4.  E(aO  =  0,  i  =  1,2,..., I,  as  previously  indicated. 

5 .  Neither  the  scatterers  nor  the  sonar  platform  is  in  motion  (this  assumption  will  be 
relaxed  later). 

It  follows  from  equation  (14)  that  the  covariance  matrix  of  Si(t)  is  given  by 
Ksi(t1,t2)  =  Fi(ti)KaiFf(t2),  (15) 


and 

E[Si(t)]  =  0,  i  =  1,2,..., I,  (16) 

reiterating  equation  (3).  Furthermore,  from  equation  (15),  the  trace  of  Ksi(t,t)  is  given  by, 
for  i  =  1,2,...,I, 

tr[Ksi(t,t)]=tr[Ff(t)Fi(t)Kai],  (17) 

and,  therefore,  the  total  energy  in  the  received  signal  is  given  by 
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(18) 


Ti 

Jtr[Ksi(t,t)]dt  =  NEftr[OiKai], 

t2 

where  Oi  is  the  positive  definite,  Hermitian  (Kj  x  Kj)  matrix, 

t2 

t|  f  !  ct)Fi<odt,  a?) 


and  is  the  normalized  waveform  correlation  matrix.  If,  for  example,  the  scattering  is  incoherent 
from  cell  to  cell,  then  the  matrix  Kai  is  diagonal  and  equation  (18)  becomes 


Ti 

J  tr[Ksi(t,t)]dt  =  NEftrIK.il,  (20) 

t2 

as  expected. 

Note  that  this  kind  of  signal  model  is  more  flexible  than  it  might  first  appear.  The  matrix  Kai 

can  account  for  coherence  among  the  signals  scattering  from  the  cells  in  the  test  region.  As 

mentioned  above,  incoherent  scattering  is  characterized  by  letting  Kai  be  diagonal  (but  not 

necessarily  of  full  rank,  since  some  diagonal  terms  may  be  identically  zero  when  the  corresponding 

cell  is  empty).  When  coherent  scattering  occurs,  Kai  will  have  off-diagonal  terms  accounting  for 

K’ 

correlations  between  pairs  from  £a,k  ]  1 1  corresponding  usually  to  neighboring  cells. 


NOISE  MODEL 


The  zero-mean  noise  process  n(t)  in  equation  (1)  has  covariance  matrix  Kn(ti,t2)  such  that 

t2 

J  tr(K„(t,t)]dt  <  co.  (21) 

Ti 
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It  is  assumed  that  n(t)  always  contains  a  white  noise  component;  the  colored  noise  component  has 
a  covariance  matrix  with  elements  that  are  uniformly  bounded  and  continuous  for  all  ti,t2»  in 
[Ti,T2]. 

Furthermore,  it  is  assumed  that  for  the  noise  process  n(t)  there  exists  a  reversible  whitening 
filter  with  an  impulse  response  (NxN)  matrix  W(ti,t2);  so  that  if  w(t)  is  a  vector- valued,  white 
noise  process,  then 

t2 

w(t)  =  j  W(t,x)n(x)dx,  Tj  <  t  <  T2  (22) 

Ti 


which  leads  to 

T2  T2 

^S(ti-t2)I  =  J  J  W(t1,x)Kn(x,x')Wt(x,,t2)dxdx/ 

(23) 

for  ti,  t2  in  [Ti.TJ;  NJ2  is  the  spectral  density  level  of  the  white  noise;  I  is  the  N  x  N  identity 
matrix.  Define  y(t)  by 

t2 

y(t)  =  J  W(t,x)x(x)dx,  Tj  <  t  <  T2; 

(24) 

then  the  model  given  by  equations  (1)  and  (14)  is  transformed  to 

^o:  y(t)  =  w(t). 

(25a) 

H'  y(t)  =  Gi(t)a;  +  w(t). 

(25b) 

where  G;(t)  is  the  Nx  K,  matrix  given  by 

t2 

Gi(t)  =  J  W(t,x)Fi(x)dx,  (26) 
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for  i  =  1,2 . 1  and  Ti  <,  t  <T2-  Equations  (25)  are  of  the  same  form  as  equations  (1)  and  (14): 

Gj(t)  replaces  Fj(t),  both  being  the  known,  matrix-valued  functions  that  pre-multiply  the  random 
vector  aj.  Because  W(ti,t2)  is  reversible,  it  can  be  applied  to  x(t),  as  in  equation  (24),  to  facilitate 
the  solution  to  the  optimal  decision  problem,  i.e.,  the  optimal  hypotheses  test  given  the  data,  x(t). 
The  reversibility  of  W(ti,t2)  is  defined  as  the  existence  of  an  inverse  W-l(ti,t2)  such  that 

T2 

/  W-i(t1,t)W(x1,t2)dx  =  5(tl  -  t2)I,  (27) 

Ti 


for  ti,t2  both  in  [Ti,T2].  The  reversibility  of  W(ti,t2)  is  a  sufficient  condition  for  the  equivalence 
of  equations  (25)  and  equations  (1)  and  (14)  for  both  the  decision  problem  and  the  estimation 
problem  (reference  1).  The  estimation  problem  will  be  discussed  briefly  in  the  next  section.  In 
what  follows,  it  is  assumed  that  the  noise  process  is  white  and  that  the  form  given  by  equations  (1) 
and  (14)  holds,  understanding  that  an  identical  form  is  obtained  if  the  noise  is  colored  and  a 
(reversible)  prewhitening  filter  is  applied  as  explained. 


ESTIMATION  OF  aj 


Before  developing  the  optimal  hypothesis  test  (classifier),  it  is  instructive  to  first  determine 
the  optimal  estimator  of  the  random  vector  aj  from  the  data  x(t),  which  under  ^  (i  =  1,2,...,I)  are 
given  by  the  linear  model  (combining  equations  (lb)  and  (14)) 

x(t)  =  Fj(t)aj  +  n(t),  Ti  £  t  S  T2.  (28) 

An  estimator  a,  is  sought  that,  under  H,  is  a  minimum  variance  estimator  of  a;  and  that  is 
unbiased  and  linear  as  follows: 


E(aj|*D  =  E(aj), 


(29) 


and 


t2 

3j=  J  H;(t)x(t)  dt,  i  =  1,2 . 1, 

Tl 


(30) 
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where  Hi(t)  is  a  K;  x  N  matrix  function  to  be  determined.  Note  that  E(aO  =  0  and  E[n(t)]  =  0; 
therefore,  E[x(t)|2£]  =  0;  and,  thus,  E(a,)  =  0,  for  i  =  1,2,..., I.  Let  £;  represent  the  estimation 
error,  ej  =  a,  -  a,,  and  consider  the  quadratic  form  q(a)  given  by 

q(ai)  =  E(efVeil^), 

where  V  is  any  K,  x  K;  positive  definite  matrix  and  a;  is  any  estimator  that  satisfies  equations  (29) 
and  (30).  Then  q(aO  is  minimized  when  a;  =  a;  and  Hi(t)  satisfies  the  special  form  of  the  Wiener- 
Hopf  equation  (references  18  and  19)  given  by 

Ti 

E[aiX+(t)|^]  =  J  Hi(T)E[x(T)x+(t)|:<]dT,  (31) 

t2 

for  i  =  1,2,..., I  and  Ti  £  t  £  T2.  From  equation  (28), 

E[aiXt(tM]  =  KaiF f(t),  i  =  1,2,. ...I,  (32) 

since  Efain^t)]  =  0  for  all  t  in  [Tj,T2].  From  equations  (9)  and  (15), 

E[x(u)xt(t2)l^]  =  Fi(t1)KaiFlt(t2)  +  ^5(ti  -  t2)I,  i  =  1,2 . 1,  (33) 


where,  as  explained  before,  it  has  been  assumed  that 
Kn(ti,t2)  =  ^  8(ti  -  t2)I,  Ti  <,  ti,t2  <  T2. 

Equation  (31)  becomes 
Tl 

KaiF^(t)  =  J  H^F^dx  KaiF,+  (t)  +  ^Hj(t), 


(34) 


for  i  =  1,2,..., I  and  Ti  <,  t  <  T2.  Now,  let  H,(t)  =  HjF^(t),  where  H;  is  a  Kj  x  K;  matrix  to  be 

determined,  and  substitute  into  equation  (34).  This  yields,  for  i  =  1,2 . 1,  the  soludon  for  H; 

given  by 

+  (35) 

where  as  before  Oj  is  the  normalized  waveform  correlation  (Kx  x  K,)  matrix  given  by 

t2 

$1  =  ^  J  Fj(t)F.(t)dL  (36) 

Note  that  equation  (34)  is  a  matrix  form  of  the  Fredholm  integral  equation  of  the  second  kind.  A 
unique  solution  exists  since,  in  this  case,  the  components  of  the  matrix  Ksi(tj,t2)  =  Fi(ti)KaiFjf(t2) 
are  continuous  in  q,  t2  over  [Ti,T2]  and  are  uniformly  bounded.  The  optimal  (under  y£)  estimator 
ai  is  then  given  by  (from  equations  (30)  and  (35)): 

T2 

a^il-Kaitl  +  pNOiKa;]'1  j  F!(t)X(t)dt,  i  =  1,2,..., I,  (37) 


where  pN  = 


2NEf 
No  ‘ 


It  can  be  shown,  using  equation  (37),  that  the  estimation  error  covariance  matrix  is  given  by 

E[(ai-ai)(a-ai)t|^]  =  Kai[l  +  pNcDlKil]  '.  1  =  1.2 . 1.  (38) 

Note  that  this  estimation  error  covariance  is  conditional  on  q  being  true.  Of  course,  the  estimator 
a;  given  by  equation  (37)  applies  only  when  ^  is  true;  its  applicability  will  become  apparent  when 
the  optimal  classifier  is  developed  in  subsequent  secaons. 

Equation  (38)  shows  explicitly  the  dependence  of  the  estimation  error  covariance  on  a  priori 
information  regarding  the  i1^  scattering  object's  spatial  properties.  Moreover,  the  dependence  on 
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system  characteristics  —  the  array  geometry  and  the  transmitted  waveform  —  is  explicitly 
represented  by  the  matrix  O;.  Thus,  the  effect  of  alternative  array  and  waveform  designs  can  be 
evaluated  directly  through  equation  (38).  For  example,  the  performance  of  sparse  (two- 
dimensional)  arrays  of  receive  transducers  can  be  assessed  for  acoustic  imaging  applications. 


OPTIMAL  BAYES  CLASSIFIER 


The  optimal  classifier  considered  here  is  a  test  of  the  multiple  hypotheses  ^,^,...,^4  that  is 
optimal  according  to  the  Bayes  criterion.  An  optimal  Bayes  test  is  one  that  minimizes  the  risk  (i.e., 
the  expected  loss)  in  applying  a  given  decision  rule.  Theoretical  details  and  proof  can  be  found  in  a 
variety  of  texts  (references  1,  20,  and  21).  The  Bayesian  risk  ^.is  defined  as  (reference  20) 

I  I 

S.=  X  X  cijPjPr[*;|*j],  (39) 

i =o  j=0 

where  cy  is  the  cost  of  deciding  hypothesis  9i  when  #j  is  true;  pj  is  PrfiHj],  the  a  priori  probability 
of  Pr[^^l  is  the  probability  that  a  given  decision  rule  chooses  H  when  is  true; 
i,  j  =  0,1,2,.. .,1.  Note  that 

X  Pj  =  1, 

i  =  0 


and 


i 


=  1  p(xl#j)dx, 


(40) 


where  p(xl^)  is  the  conditional  probability  density  function  of  a  measurement  vector  x  given 
Zi  is  a  region  in  the  sample  space  of  x  such  that  H  is  chosen  if  a  realization  of  x  is  contained  in  Z*; 
the  regions  {ZiJg  are  disjoint  and  cover  the  entire  sample  space.  Here  x  instead  of  x(t)  is  used  for 
reasons  given  in  the  next  section.  It  can  be  shown  (reference  20)  that  the  risk  ^.is  minimized  by 
the  following  decision  rule: 
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choose  #kifDk(x)  =  max{D0(x),Di(x),...,DI(x)), 


(41) 


where  D;  is  the  so-called  discriminant  score  defined  as 
I 

Di(x)  =  -X  Cijpjp(xl^j),  i  =  0,1,2,..., I.  (42) 

j=o 

Note  that  for  the  special  case  of  c»j  =  1  -  5ij,  ^is  the  total  probability  of  error  and  Dj  reduces  to 
D,(x)  =  pip(xl^  +  c,  i  =  0,1,2,...,I,  where  c  is  a  constant  independent  of  i.  In  this  special  case, 
Dj(x)  is  directly  proportional  to  Pr[?flx],  the  a  posteriori  probability  of  the  hypothesis  Pi,  since 

tTtftM  =  ,  .  (43) 

X  P(*W|)P1 
i=0 

The  decision  rule  defined  by  equations  (41)  and  (43)  is  the  so-called  maximum  a  posteriori 
decision  rule.  In  this  report,  the  decision  rule  is  not  restricted  to  this  special  case,  but  preserved  in 
the  more  general  form  of  the  decision  rule  of  equation  (42). 

Now,  observe  that  equation  (42)  can  be  written  as  follows: 

I 

Di(x)  =  p(xl^))X  XijAi(x),  (44) 

j-i 

where 


Ai(x) 


p(xlflp 

p(xl^)  ’ 


which  is  the  likelihood  ratio  of  Pi  with  respect  to  Pi)  for  i  =  1,2,..., I.  Note  that  Ao  =  1  and  Xy  is 
defined  by 

Xij  =■  — CijPj.  id*  0,1,2 . 1. 


Since  the  factor  p(xl^)  is  common  to  ail  Dj(x),  an  equivalent  decision  rule  is  given  by 
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choose  ^4 if  Ek(x)  =  max{E0(x),E1(x) . E^x)}, 


(45) 


where  Ei(x)  is  defined  by 
1 

Ei(x)  =  X  ^ijAj(x),  i  =  0,1,.. .,1.  (46) 

j=Q 

Note  that  while  the  decision  statistic  E,(x)  is  a  linear  function  of  the  likelihood  ratios  { Ai(x)}g  it  is 
not  in  general  a  linear  function  of  the  data  x.  The  functional  dependency  on  x  obtains,  of  course, 
from  the  specific  distributional  form  assumed  for  the  likelihood  functions  p(xl^O,  i  =  0,1,2,...,I. 
The  linear  dependency  of  E;(x)  on  the  set  of  constants  {Xij}g,  which  in  this  report  are  called 
classifier  weights,  is  convenient.  Of  course,  strict  adherence  to  the  Bayesian  formalism  requires 
values  for  each  cost  c;j  and  each  a  priori  probability  pj  to  compute  the  corresponding  weight 
Specification  of  these  values  is  usually  impractical;  although,  assuming  equal  a  priori  probabilities, 
i.e.,  assuming  pj  =  1/(1  +  1),  j  =  0,1, 2,... I,  may  be  reasonable.  Moreover,  the  costs  (however 
“cost”  may  be  defined  in  this  context)  of  correct  decisions  (c^q  may  be  reasonably  set  to  zero. 
Given  these  assumptions, 

•  0  i  =  j 

^ij  =  )  Cij  i,j  =  0.1....I.  (47) 

[  HI  1 

Further  specification  of  the  costs  is  difficult  given  that  a  meaningful  cost  measure  (such  as  time 
spent  searching  by  the  sonar  platform)  can  even  be  defined  in  the  first  place.  It  would  not  be 
reasonable  to  assume,  as  in  the  case  that  leads  to  the  maximum  a  posteriori  decision  rule,  that  all 
errors  have  identical  costs.  For  example,  consider  the  case  1  =  2.  Let  9$  denote  noise  only,  p{\ 
denote  real  target  and  noise,  and  #2  denote  false  target  and  noise;  then  perhaps  C02  =  C20,  but 
certainly  C02  <  C12  or  C21,  for  example. 

These  difficulties  become  compounded  as  I  increases,  since  the  number  of  error  types  is 
1(1+1).  Nonetheless,  the  Bayesian  decision  rule  given  by  equations  (44)  and  (45)  does  give 
structure  to  classifier  design  and  shows  how  the  likelihood  ratios  (Ai(x)}|  figure  in  the  design. 

This  structure,  when  applied  to  experimental  data,  offers  a  method  of  determining  the  coefficients 
(Xjj)  empirically  (given  auxiliary  constraints  like  equation  (47),  for  example).  One  possible 
approach  is  to:  (1)  specify  values  of  the  probabilities  of  each  type  of  error  (except  one);  (2) 
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determine  from  experimental  data  empirical  operating  characteristics  curves  by  iterative  variation  of 
the  coefficients  {Xjj},  selecting  values  corresponding  to  the  assigned  error  probabilities. 

A  theoretical  approach  to  determining  the  coefficients  (Xy)  is  to  apply  a  generalization  of  the 
Neyman-Pearson  Lemma  (references  7,20,  and  21)  to  the  multiple  hypothesis  test  directly  instead 
of  using  the  Bayes  criterion.  However,  this  approach  leads  to  a  decision  rule  equivalent  to  that 
specified  by  equations  (45)  and  (46)  except  that  the  coefficients  {Xjj}  could,  in  principle,  be 
expressed  directly  as  functions  of  the  specified  error  probabilities. 


LIKELIHOOD  RATIO 


REPRESENTATION  OF  x(t) 

Before  proceeding  further,  the  technical  problems  of  a  discrete  representation  of  the  vector¬ 
valued  process  x(t)  must  be  addressed.  This  will  facilitate  development  of  the  optimal  Bayes  test 
as  will  be  shown  in  the  next  section.  An  expansion  is  sought  of  the  form 

M 

x(t)  =  l.i.m.  X  Xm'Pmft),  (48) 

M~»-  m=\ 

where  { xm)  are  (scalar)  random  variables,  {'f'mO))  is  a  set  of  deterministic,  orthonormal  N  x  1 
vector-valued  functions  (complete  over  CN  xfTi.TxI),  and  “l.i.m.”  denotes  limit  in  the  mean  square 
sense.  Equation  (48)  can  be  restated  as 

x(t)  =  l.i.m.  xM(t),  (49) 


where  xm(0  is  defined  by 
M 

*m(0  =  X 

m=l 


(50) 


Since  (vPm(t)}  is  a  set  of  orthonormal  functions,  then 
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T2 

*m  =  J  'P  m  (t)xM(t)dt,  m  =  1,2 . M.  (51) 

Tl 

First  consider  n(t),  a  white  noise  process.  It  can  be  shown  (given  methods  in  reference  5) 
that  its  Karhunen-Loeve  expansion  can  be  made  with  any  complete  set  of  orthonormal 
functions;  i.e.. 


M 

n(t)  =  l.i.m.  2  nm'Fm(t), 

m=l 


(52) 


where 


r>m  — 


T2 


I 


'FlWnWdt, 


(53) 


from  which  it  follows  that,  for  all  m,n  =  1,2,..., 


NQ 


Etnmn J  “  2^mn, 


(54) 


since  Kn(ti,t2)  =  (No/2)8(ti-t2)  I  for  all  ti,t2 in  [Ti,T2]. 

The  next  step  is  to  expand  S;(t)  using  the  set  { vPm(t) } ;  i.e.,  expand  Sj(t)  as 
M 

Si(t)  =  l.i.m.  X  sim  i  =  0,1, 2,... I. 

M“*“  m=l 

An  expansion  is  sought  such  that  only  the  random  sequence  (Sim)  depends  on  i,  where,  as  before, 
the  index  denotes  the  hypothesis  9{.  Note  that  the  set  ('Fm(t)}  is  to  be  independent  of  i,  and 
orthogonality  leads  to 

T2 

Sim  =  J  ^  ^(Os^Odt,  m  =  l,2 .  (55) 

Tl 
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Here  it  is  assumed  that  Sj(t)  is  of  the  form  given  by  equation  (14) 


Si(t)  =  Fi(t)ai, 

where  Fi(t)  is  a  deterministic  (N  x  KJ  matrix,  and  a;  is  a  random  (K  x  1)  vector.  The  matrix  F;(t) 
can  be  expressed  as 

F  i(t)  =  [fii(t).fi2(t) . f»Ki(t)].  (56) 

where  f&lt)  is  an  (N  x  1)  vector  that  is  deterministic  and  can,  therefore,  be  expanded  by  the 
complete  set  {^(t)}: 

M 

fik(t)  =  Km  X  fikm'Fm(t).  k  =  l,2,...,Kif  (57) 

and 

t2 

fikm=  1  'F  m(0fik(t)dt,  m  =  1,2,....  (58) 

Tl 

By  writing  equation  (14)  as 

Si(t)=Iaikfik(t),  (59) 

k=l 

S{(t)  can  then  be  expressed,  using  equation  (57),  as 
M  t  Kj  \ 

Si(0  =  jini  iw  ^ik^ikm  'Fm(0»  (60) 

M— *o»  m_i  ^k=1  J 

which  is  the  desired  expansion  with 
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Sim  =  S  ^ik^ikm*  m  =  1,2,...;  i  =  1,2,. ..,1.  (61) 

k=l 

Now,  x(t)  can  be  expanded  as  in  equation  (48)  where,  under  the  sequence  (xm)  is  given 

by 

Xm  —  Sjm  ■*"  fim>  ®  =  1,2,...,  (62) 

with  the  sequences  {Sim}  and  (nm)  being  defined  by  equations  (61)  and  (53),  respectively.  If  the 
vector  x  is  defined  as  x  =  [xi,x2,...xm]t,  then,  under 

x  =  s;  +  n,  (63) 

where  Sj  and  n  are  similarly  defined  vectors.  Then,  the  following  hold: 

E(xW0  =  0,  (64) 

since  Si(t)  and  n(t)  are  zero-mean  proces.  :s; 

E(xx+I#i)  =  E(s;S;)  +  (No/2)  I,  (65) 

from  the  independence  of  Si(t)  and  n(t)  and  from  equation  (54).  If  equation  (61)  is  written  in 
matrix  form  as 

Si  =  F,ai,  (66) 

where  Fi  is  the  MxK;  matrix  with  components  [FJmjc  =  f^,  then  E^s*)  is  given  by 

E(SjS^)  =  F,KajF;.  (67) 


The  matrix  Fi  can  be  compactly  expressed,  using  equation  (58),  as 


Ti 


Fi  =  J  vF+(t)Fi(t)dt, 


(68) 
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where  the  N  x  M  matrix  'f'(t)  is  defined  by 


¥(t)  =  [vFi(t),‘F2(t),...,^M(t)].  (69) 


In  summary,  x  is  a  truncated  discrete  representation  of  the  process  x(t)  in  terms  of  the 
complete  set  of  orthogonal  vector-valued  functions  {^(t)}: 


Ti 

x  =  j  vFt(t)x(t)dt,  (70) 

7  2 


which  is  equation  (51)  in  matrix  form.  Moreover,  for  all  i  =  0,  1,2,..., I,  Kxi  (defined  as  the 
covariance  matrix  of  x,  i.e.,  Kxi  =  E(xxtli^))  is  given  by 

(71a) 

K,i  -  FjK^Ft  +  ^  I.  (71b) 

It  should  be  pointed  out  that,  in  this  case,  equation  (48)  is  not  the  Karhunen-Loeve  expansion 
of  x(t),  since  the  coefficients  { xm}  are  not  mutually  uncorrelated.  The  vector-valued  functions  need 
only  form  a  complete  (orthogonal)  set;  otherwise,  they  are  arbitrary.  In  general,  the  Karhunen- 
Loeve  expansion  of  x(t)  under  all  does  not  exist  for  I  >  2.  The  bases  for  the  Karhunen-Lo£ve 
expansion  are  the  vector- valued  eigenfunctions  of  the  covariance  matrix  Kxi(t],t2)  that,  of  course, 
vary  with  i;  i.e.,  for  i  =  1,2,..., I, 

Ti 

J  Kxi(t,T)eim(T)dT  =  A.imejn(t),  (72) 

7  2 

for  t  in  [Ti,T2]  and  m  =  1,2,...M.  The  vector-valued  eigenfunctions  {e^t)}  and  the 
corresponding  eigenvalues}^}  are  not,  in  general,  independent  of  i.  Since  the  detection 
problem  is  the  binary  test  (I  =  1),  the  Karhunen-Loeve  expansion  can  be  applied  (references  3  and 
4).  However,  the  classification  problem  is  at  least  ternary  (I  >  2)  and  coefficients  that  are  mutually 
uncorrelated  under  each  cannot,  in  general,  be  found.  At  any  rate,  such  uncorrelated 
coefficients  are  unnecessary  for  the  development  herein. 
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GAUSSIAN  CASE 


If  x(t)  is  a  complex  Gaussian  process  then,  as  is  well  known,  a  linear  functional  like  that  in 
equation  (70)  produces  a  Gaussian  vector  x  with  mean  and  covariance  matrix  given  by  equations 
(64)  and  (71),  respectively.  Explicitly,  the  probability  density  function  of  x  conditioned  on  ^  is 
given  by, 


p(xl#D  = 


exp[-xtKxx],  for  i  =  0,1,2,...,!, 


7tMlKXll 


(73) 


where  IKxil  is  the  determinant  of  Kxi  (and  the  symbol  x  is  used  for  the  variable  in  the  function 
p(-l^)  as  well  for  the  random  (vector)  variable). 

The  likelihood  ratio  in  equation  (44)  is  given  by 

A‘<*>=S  1  ■  O’1-2- 

which  under  the  Gaussian  assumption  becomes 


Ai(x)  =|^i7exp[-xt(Kxi  -  K*0)x]  (74) 

for  i  =  1,2,...,I.  For  the  case  of  continuous  sampling,  that  is,  for  the  process  x(t)  for  all  t  in 
(Ti.Til,  the  likelihood  ratio  is  defined  as  (references  3  and  4) 

Ai[x(t)]  =  l.i.m.  Aj(x),  i  =  1,2,...,I.  (75) 

M-+— 


Therefore,  it  is  now  necessary  to  determine 


l.i.m.  x^K'lx, 
m-*~  x« 


and 


IK 


xnl 


l.i.m. 

|KXl| 


for  i  =  0,1,2,...,!. 
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Assume  that  x(t)  can  be  expanded  in  terms  of  the  complete  orthonormal  (CON)  set  {'FmCt)} 
under  all  i  =  0,1,2,..., I  as  in  equations  (49)  and  (50).  Indeed,  this  has  been  shown  to  be  so,  at 
least  for  the  model  of  x(t)  adopted  herein,  viz.,  equation  (28).  Then,  from  equation  (70), 

T  2  T 2 

Kxi  =  J  J  'Pt(ti)Kxi(ti,t2)'P(t2)dt1dt2.  (76) 

T  l  T! 

Now,  express  K^j  in  the  following  form: 

T2  t2 

K;!=  |  J'ft(ti)Qxi(ti,t2)4'(t2)dtidt2,  (77) 

T 1  7 1 

where  the  (N  x  N)  matrix  function  QXj(£i>t2)  is  to  be  determined.  It  can  be  seen  immediately  from 
equation  (77)  that 

T 2  T2 

xtK;!x=  j  J  x^aOQxKti.t^d^dtidti  (78) 

1  Ti  T  i 

since  from  equation  (50),  written  in  vector  form, 

x.mO)  =  ^(Ox, 

where  the  matrix  'P(t)  is  defined  by  equation  (69).  Then 

t2  t2 

l.i.m.  x'K*lx  =  J  J  xtftOQ^ti.t^xO^dtjd^, 

M—  T  i  T  i 

for  i  =  0,1,2 . 1. 

What  remains  is  to  derive  an  integral  equation  for  QxjOi.t^.  As  it  turns  out,  the  equation  is 
easy  to  solve  for  the  assumed  model  of  x(t)  under  the  alternative  for  all  i  =  0,1, 2,.. ..I.  Multiply 
equations  (76)  and  (77)  to  get 


(79) 


(80) 
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I-J/JJ  'Pt(t,)Kli(t1,t)4'(t)4'»(OQ,i(t',t2)'f (t2)dt,dtdt'dt2, 


(81) 


where  the  integrals  are  each  over  [Ti.TJ  and  I  is  M  x  M  in  this  case.  Next  recognize  that 

M 

S(ti  -  t2)I  =^1  I  =  kUm  ^(ti^tdj),  (82) 

m=l  M-*— 


for  any  ti,t2  in  [Ti.TJ.  Pre-multiplying  equation  (81)  by  H/(tl').  then  post-multiplying  it  by 
H/t(t2'),  and  taking  the  limit  as  M  yields,  invoking  equation  (82), 


5(t'i  - 1'2)  I  =  j  J  {  J  5(t'!  -  t1)KXl(ti,t)5(t  -  OQXl(t'.t2)5(t2  -  t,2)dt1dtdt'dt2,  (83) 

which  reduces  to  (after  dropping  primes),  for  all  i  =  0,1,2 . 1, 

T2 

j  Kxi(ti,t)Qxi(t,t2)dt  =  8(t!  -  t2)I,  Tj  <  ti,t2  £  T2.  (84) 

It  can  now  be  seen  the  QXi(ti,t2)  is  the  inverse  kernel  of  KXl(tltt2)  as  defined  by  the  matrix-integral 
equation  (84).  Clearly,  since  Kx0(ti,t2)  =  (No/2)8(ti  -  t2)I  for  tj,t2  in  [Ti.TJ,  then 

Qx0(ti»t2)  =  j^6(ti  -  t2)L  (85) 


Now,  for  i  =  1,2,..., I.  equation  (84)  becomes,  using  equation  (9)  with 
K„(ti,t2)  =  (N0/2)5(tl  -  t2)I, 

T2 

J  Ksi(ti,t)Qxi(t,t2)dt  +  (No/2)Qxi(ti,t2)  =  5(t,  -  t:;I.  T,  <  ti,t2  £  T2.  (86) 

Ti 

Define  Hi(ti,t2)  by  the  following  equation,  for  i  =  1 .2..  .1: 

Qxi(ti»t2)  =  (2/N0)5(ti  -t2)I-Hi(ti,t2),  T£t|,t:STi  (87) 

Then,  equation  (86)  becomes 


26 


(88) 


T2 

J  Ksi(ti,t)Hj(t,t2)dt  +  Hj(t1>t2)  =  (2/N0)Ks;(ti ,t2). 

Ti 

Equation  (88)  is  the  important  matrix  integral  equation  that  must  be  solved  to  gain  an  explicit 
expression  for  the  likelihood  ratio  functional  of  equation  (75).  Using  equations  (80),  (85),  and 
(87),  it  can  be  seen  that  Aj[x(t)]becomes 

■  t2  t2 

Ai[x(t)]  =  Yoi  exp  /  J  xt(tI)Hi(ti,t2)x(t2)dtidt2 
T 1  T 1 


where 


YOi 


=  l.i.m. 

M  — ><»> 


which  can  be  assumed  for  now  to  exist.  Equations  (88)  and  (89)  define  the  likelihood  ratio  that 
obtains  at  least  for  the  case  when  x(t)  can  be  expanded  in  terms  of  a  CON  set  {'Fm(t)}  that  is 
independent  of  i  =  0,1,2,...,I. 

Equation  (88)  is  now  solved  in  closed  form  for  the  special  case  of  Ksi(ti,t2)  given  by 
equation  (15),  viz., 

KSi(ti,t2)  =  F^tUKaj  Fit(t2),  Ti  <  ti,t2  ^T,  i  =  1,2,...,!. 


It  is  straightforward  to  prove  that  Hi  (t\,t2)  is  given  by 
Hi(t!,t2)  =  (2/N0)2Fi(t1)Hi  F,+(t2), 


(90) 


where  Hi  is  as  in  equation  (35),  viz., 
r  ”>NEr  X 
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This  can  be  shown  by  substituting  equations  (15)  and  (90)  into  equation  (88)  and  solving  the 
resultant  matrix  equation  for  H;.  With  hindsight,  of  course,  it  can  be  shown  that  equations  (35) 
and  (90)  specify  the  solution  by  direct  substitution  into  equation  (88),  given  Ksi(ti,t2)  as  in 
equation  (15).  In  summary,  equation  (89),  the  likelihood  ratio  functional,  becomes 


Aifx(t)]  =  y0iexp 


t2 

J  xt(t1)Fi(ti)HiF1t(t2)x(t2)dtidt2 


(91) 


Equation  (91)  can  be  written  in  several  alternative  forms  as  follows: 

Ai[x(t)]  =  Yo»exp(yil’Hiyi),  (92) 


where 


t2 

Ji-JT  J  F*(t)x(t)dt. 

IN0  Tj 


(93) 


Thus,  y;  is  the  output  of  a  matrix  matched  filter  operation.  Another  form  is 


Ai[x(t)]  =  Yo,expJ 


(94) 


where 


§i(t)  =  Fi(t)a,. 


Under  ?i,  a;  is  the  MVLU  (minimum  variance,  linear,  unbiased)  estimator  of  a;  and  is  given  by 
equation  (37).  Thus,  §,(t)  is  the  optimal  estimator  of  s,(t)  and  equation  (94)  represents  the  familiar 
“estimate  and  match”  structure. 


28 


Example:  The  Ternary  Problem 


Certain  assumptions  can  now  be  made  regarding  the  classifier  decision  statistic  as  previously 
given  by  equation  (46)  rewritten  in  terms  of  the  likelihood  ratio  functional 

I 

Ei[x(t>]  =  X  XijAj(x(t)l,  i  =  0,1,2 . 1.  (95) 

j=o 

First,  let  cu  =  0  and  Pi  =  p  where  0  <  p  <  1  and  i  =  0,1,2 . 1.  By  this,  it  is  assumed  that  each 

hypothesis  is  equally  probable  a  priori  and  that  the  costs  of  correct  decisions  are  zero.  Moreover, 
it  is  assumed  that  the  cost  of  deciding  M,  given  Mq  as  true  is  independent  of  i;  i.e.,  let  Cio  =  Co,  any 
non-negative  constant,  for  all  i  =  1,2,. ..,1.  With  these  assumptions,  equation  (95)  becomes  for  the 
case  of  I  =  2  the  ternary  case: 


EoH  =  X-oiAiH  +  X.02A2H  > 

(96a) 

Ei[-]  =  X-o  +  A.i2A2[-], 

(96b) 

E2H  =  X-o  +  X.21A1H, 

(96c) 

where  Xo  =  -cop.  Using  the  decision  rule  prescribed  by  equations  (45)  and  (46),  the  following 
obtains: 


choose  Mo  if  EiH  <  Eo  [■]  and  E2[  ]  <  EoH;  (97a) 

choose  Mi  if  Ej£*]  >  EoH  and  Ej [*]  >  E2H;  (97b) 

choose  Mi  if  E2[  ]  >  EoH  E2[  ]  >  Ei[  ],  (97c) 

In  the  above  comparisons,  Ej[]  >  Eo[  ],  i  =  1,2, ...,I  are  tests  for  detection  of  the  corresponding 
signals  s,(t).  On  the  other  hand,  the  comparison  EiH  >  E2[-]  represents  the  classification  of  the 
data  x(t)  given  that  both  signals  have  been  correctly  detected;  it  is  this  test  that  discriminates 
between  the  alternative  target  classes.  By  using  equations  (96b)  and  (96c),  the  classifier  rule 
becomes  (remembering  that  Xn,^2i  <  0): 
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choose  Mi  if 


Ml 

a2[] 


X,i2 

>  ~ — ,  otherwise,  choose  fy. 
Mi 


(98) 


Taking  the  logarithm,  as  usual,  and  using  equation  (89),  inequality  (98)  becomes: 


xf(ti)[H1(ti,t2)  -  H2(ti,t2)]x(t2)dtidt2  >  / 12, 


(99) 


where  / 12  =  In  (Xi2Yo2A2iYoi)-  For  the  special  case  when  Hi(ti,t2)  is  as  specified  by  equations 
(35)  and  (90),  the  above  becomes,  defining  L  i2  as  the  logarithm  of  the  likelihood  ratio. 


£  i2  =  yf(Hi  -H2)y  >/ 12, 


(100) 


where  the  additional  assumption  is  made  that  F;(t)  =  F(t)  for  i  =  1,2,  and  y  is  defined  by 


y 


T2 

^  ^  F+(t)x(t)dt. 


This  new  assumption  implies  that  for  each  alternative  hypothesis  (signal  class),  the  time  delay  set 
can  now  be  written  as  {Tnk},  i.e.,  independently  of  the  index  i.  It  is  not  an  overly  restrictive 
assumption,  as  it  turns  out.  There  is  flexibility  in  defining  the  common  test  region  (as  previously 
defined  and  illustrated  in  figure  1),  since  it  is  not  required  for  Kai  to  have  full  rank. 

At  this  juncture,  the  distribution  of  the  test  statistic  Li2,  under  or  #2,  can  be  derived, 
invoking  the  Gaussian  assumption  and  certain  other  assumptions.  The  classifier  performance  can 
then  be  calculated  as  a  function  of  the  critical  parameters  describing  both  the  system  (array,  signal 
design,  etc.)  and  the  set  of  scattering  objects.  The  performance  results,  viz.,  operating 
characteristic  curves,  will  be  presented  in  a  later  section.  But  first,  properties  of  the  estimator  £j 
will  be  studied  further.  As  shown  by  equation  (94)  estimation  of  and  its  error  variance  matrix 
(equation  38)  will  clarify  certain  of  the  classifier's  properties  such  as  resolution. 
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VARIANCE  OF  a*  AND  RESOLUTION 


The  MVLU  (under  estimator  £»i  can  be  written  as,  from  equations  (35),  (37),  and  (93), 

aj  =  H;y;,  i=l,2,...,I.  (101) 

Note  that,  given  the  definition  of  y;,  the  following  holds: 

yi  =  PnO^  +  vj,  (102) 

where  the  noise  term  vj  is  now  given  by 
T2 

Vi  =  J  F  |  (t)n(t)dt. 

Ti 


Equation  (102)  is  characteristic  of  a  class  of  so-called  inverse  problems  where,  in  this  case,  the 
inverse  operator  is  Hj.  Observe  that  when  either  pN  00  or  K^1  — >  0,H;  approaches  O"1;  i.e., 
when  a  priori  knowledge  is  either  unnecessary  because  of  high  signal-to-noise  ratio  or  is 
altogether  absent,  a,  is  simply  the  maximum  likelihood  csnmaior* 

3i  =  —  Of  Vi.  (103) 

Pn 

In  this  case,  the  inverse  operator,  while  theoretically  nonsingular,  may  be  ill-conditioned. 

Recall  that  from  the  definition  of  O;  (equation  ( 19  n. 

1  N  ? 

[^ijjk  ~  jsjgf  J  —  ^inj)f(t  —  ^ink)dt.  J.k.  =  1.2,...,Ki,  (104a) 

*  n=I  Ti 


or,  using  Parseval's  theorem. 


* 


0  in  the  sense  that  the  minimum  eigenvalue  of  K„  -♦  - 
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(104b) 


[^iljk  =  J  |f(C0)P  I  exp[-i(0(Tinlc-  Xinj)] 


n=l 


where  ffca)  denotes  the  Fourier  transform  of  f(t).  (To  economize  on  notation,  the  meaning  of  f(-) 
changes  implicitly  here  depending  on  whether  t  or  co  appears  in  the  argument) 


From  equation  (104b)  it  is  clear  that,  as  resolution  increases,  i.e.,  as  the  differences 
{Xinj  -  tink)  decrease,  the  elements  of  O;  approach  unity,  and  the  matrix  becomes  ill-conditioned. 
When  the  positive  definite,  Hermitian  matrix  d>;  is  ill-conditioned,  c(Oj),  its  so-called  condition 
number,  is  large;  c(<t>i)  is  defined  by  c(Oj)  =  HOjll  /  ll<Dflll,  where  INI  is  any  matrix  norm  (references 
22  and  23).  For  the  2-norm,  c(<J>;)  =  IIOjll2  /  HOi'1ll2  =  XacAii.  where  A.*  and  A.^  are  the  maximum 
and  minimum,  respectively,  eigenvalues  of  (fy.  Consider  further  the  maximum  likelihood 
estimator  given  by  equation  (103)  by  examining  the  sensitivity  of  a,  to  pertrubations  in  both  <I>i  and 
yi.  Let  5aj  and  5y;  be  arbitrary  perturbations  of  a;  and  y;,  respectively,  and  suppose  that  the 
perturbation  5C>i  of  <D;  is  '•mall  enough  that  a  <  1,  where  a  =  IM**1  6d>ill.  Then,  the  following 
well-known  inequ^r.  vreference  23)  holds  (for  any  vector  norm  and  consistent  matrix  norm): 


)l5<D;in 

llOill 


(105) 


Equation  (105)  reveals  the  decreased  sensitivity  of  a;  to  perturbations  both  of  the  data  yx  (viz., 
measurement  noise)  and  of  the  matrix  O,  as  c(0-,)  decreases.  The  perturbation  of  Oj  may  represent 
uncertain  variations  in  the  array  geometry  and  test  region  (e.g.,  the  set  {x^b, }).  However,  the 
introduction  of  Ka;  has,  in  addition  to  the  Bayesian  statistical  interpretation,  the  algebraic  effect  of 
conditioning  the  inverse  Hi  =  [K’|  +  PN^i]  which  can  be  thus  written  when  K^1  exists.  Note 
that  for  actual  computation,  the  form  of  Hj  given  by  equation  (35)  is  used  in  case  Kai  is  singular. 
These  effects  will  be  illustrated  in  later  examples. 


It  is  significant  that  the  estimation  problem  (or  the  inverse  problem  as  just  discussed)  can  be 
physically  interpretated  as  an  acoustic  imaging  process.  From  equation  (38),  the  variance  of  the 
estimation  error  is,  as  given  previously, 

var(ail^)  =  Hi  i  =  1,2,...,I, 


where  equation  (35)  again  is 
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Hj-K^I  +  p^K.J 


i  =  1,2 . 1; 


and  O;  t  given  by  equation  (19),  can  be  rewritten,  using  Parseval's  theorem  as 


<Di  = 


1 

NEf 


j  F  {(co)Fj(co)— , 

-  oo  2  TC 


(106) 


where  Fi(co)  denotes  the  Fourier  transform  of  Fi(t).  Since  the  elements  of  Fi(t)  are  given  by 
[FilOlnk  =  f(t-  link),  for  n  =  1,2 . N  and  k  =  1,2 . Kj,  then  equation  (106)  becomes 


<t>i=  J  !f(0))|2pf (0)^(0))^, 

— oo  2tc 


(107) 


where  f(co),  as  before,  is  the  Fourier  transform  of  f(t)  and  the  N  x  K,  matrix  Pi(co)  is  defined  by 

[Pi(o>)]nk  *  exp(-icotink),  n  =  1,2,...,N;  k=l,2 . K.  (108) 

(Equation  (107)  is  simply  equation  (104b)  with  matrix  notation.)  The  matrix  Pi(to)  completely 
defines  the  geometry  of  the  receiving  array  of  point  transducer  elements  and  the  geometry  of  the 
test  region,  as  previously  defined,  for  the  i1*1  object.  Of  course,  it  is  actually  the  set  of  time  delays 
{^inkl ! 1  that  describes  the  geometry,  and  these  delays  are  further  specified  as  follows.  Referring 
to  figure  2,  one  can  see  that  for  i  =  1,2,. ..,1 

ctink  =  Hrucll  +  Hrik  -  dnll,  k  =  1,2 . Kit  n  =  1,2 . N,  (109) 

where  is  the  position  vector  fixing  the  origin  of  the  k1*1  cell  of  the  i1*1  object's  test  region;  and  dn 
is  the  position  vector  for  the  n^  element  in  the  receiving  array.  A  nonreffactive  medium  with 
sound  speed  c  is  assumed.  A  Taylor  series  expansion  in  terms  of  dn  about  0  on  the  right  side  of 
equation  (109)  gives,  showing  only  first  order  terms, 

C'Eink  =  2llrikll  —  Ct  jjjd,,  +  ...,  (110) 


33 


where  =  rfr/llr^ll,  the  unit  vector  along  r^.  To  neglect  higher  order  terms  is  to  assume  that 
each  of  the  IQ  cells  is  in  the  Fraunhofer  zone  of  the  array.  Note  that  the  second  order  terms 
correspond  to  the  Fresnel  zone  approximation.  Here  it  is  assumed  that  for  the  array  dimensions 
and  distances  of  interest,  the  Fraunhofer  approximation  applies  and  equation  (110)  without  the 
higher  order  terms  is  taken  to  be  exact .  In  this  case,  <2>i  becomes 

J*  |lf(“)|2exp[i(D(tij-tik)]Z  exp[-ito(aij  -  aik)Tdn/c]|^  (111) 

2 

where  tik  =  - llrikll,  j,k,  =  1,2,..., IQ  and  n=l,2,...,N. 

v 


Figure  2.  Nomenclature  for  Nonreffactive  Medium 


Example:  Two-dimensional  case  and  linear  array. 

Consider  the  case  where  =  [sinOk,  cos9k,0]T,  k  =  1,2,...,K.  (The  subscript  i  can  be 
temporarily  dropped  for  this  example.)  For  a  uniform  linear  array,  dn  =  [(n  -  l)d,  0,0]T. 
n  =  1,2,. ..,N,  as  shown  in  figure  3. 
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Further  assume  in  this  example  that  all  cells  are  within  a  single  range  annulus,  i.e.,  assume  that 
Tk  =  Ti  for  ail  k.  Equation  (111)  then  becomes 


where  uj*  =  sinQj  -  sin©*,  j,k  =  1,2,...,K.  Now  let  the  cells  be  uniform  in  angular  extent;  i.e.,  let 
©k  =  (k  -  1)A0,  k  =  1,2,...,K.  The  values  of  A©  specify  the  angular  resolution  of  the  processor 
(equation  (103))  with  0  given  by  equation  (1 12)).  To  proceed  with  calculations,  let  Ka  be  given 
by 

Ka  =  Cal.  (113) 


Figure  3.  Example:  Two-dimensional  t'u>c  und  Linear  Array 
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The  example  represents  the  case  of  scattering  strength  uniformly  distributed,  within  a  range 
annulus,  among  K  sectors  of  angular  extent  Ac.  The  variance  of  a,  var(a),  is 


var(a)  =  H  =  [1C;1  +  pN<J>]-‘, 


(114) 


which,  for  this  example,  becomes 
var(a)  =  oJ[I  +  Pn^O]"1. 


(115) 


The  quantity  of  interest  here  is  termed  the  variance  ratio,  the  trace  of  the  error  variance  matrix 
normalized  by  the  trace  of  Ka,  and  is  given  by,  for  this  example, 

2i^=iff[i+PNOl2<i.r‘ ,  (ns) 

K°a 

where  O  is  given  by  equation  (112).  The  variance  ratio  provides  a  means  of  calculating  a  measure  of 
the  variance  reduction  achieved  by  the  processor  (i.e.,  a,  the  estimator  of  a)  as  a  function  of 
important  parameters  such  as  Pn<^  (received  signal-to-noise  ratio)  and  the  resolution  parameter  A0. 
For  the  example,  the  variance  ratio  is  calculated  (see  figure  4)  for  N  =  8,  co0d/c  =  7t,  K  =  2,3,8 
(angle  cells  each  of  width  A0).  It  is  assumed  that  lf(Q))l2  is  narrowband  at  center  frequency  oh  and  is 
approximated  by  Ef6(co  -  co0)  in  equation  (113).  The  results  of  these  calculations  are  shown  in 
figure  5.  Note  that  for  a  uniform  linear  array,  the  beamwidth  BW,  as  measured  between  the  first 
nulls,  is  given  by,  for  center  wavelength  Xo, 

BW  =  2sin-'(^)  ,  (117) 

and  is,  for  this  example  BW  =  29*.  The  curves  in  figure  5  clearly  illustrate  the  tradeoff  between 
resolution  and  error  variance;  as  the  processor  attempts  greater  angular  resolution,  i.e.,  as  A0 
decreases,  the  error  variance  increases.  The  tradeoff  is  more  pronounced  as  the  received  signal-to- 
noise  ratio  pN^a  increases  and  as  the  number  of  cells  increases.  To  precisely  specify  angular 
resolution  in  terms  of  A0,  the  variance  ratio  tr[var  (a)]/tr[KJ  and  the  number  of  cells  must  be  first 
specified.  Corresponding  values  of  A0  as  a  function  of  PN^a  or' rnore  practically,  increments  of 
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Figure  4.  Test  Regions  for  Example:  Two-dimensional  Linear  Array 


PN<Ja  can  be  determined.  For  example,  referring  to  figures  6  and  7,  if  a  variance  ratio  of 

10“2  (-20  dB)  is  specified,  then  for  K  =  2  cells  a  change  in  resolution  from  A9  =  BW/2  (=  14.5*) 

to  A0  =  BW/8  (=  3.62*),  a  fourfold  improvement,  requires  an  increase  in  pNCa  of  about  6  dB. 

2 

However,  to  acheive  this  same  improvement,  but  for  K  =  8  cells,  ps<Ja  must  be  increased  by  about 
60  dB.  This  example  reveals  the  strong  dependence  of  resolution  on  the  number  of  cells  to  be 
resolved.  Note  that  in  both  cases,  K  =  2  and  K  =  8,  the  curve  for  A0  =  BW/2  (=  14.5*)  nearly 

coincides  with  the  curve  corresponding  to  G>  =  I,  which  represents  so-called  ideal  resolution  and  is 

independent  of  K;  in  this  case,  the  variance  ratio  is  given  simply  by 

2  •  (118) 

l+pNaa 

Further  insight  into  the  resolving  power  of  the  combination  of  a  particular  array  and 
waveform  is  gained  by  an  eigenanalysis  of  the  correlation  matrix  <X>.  In  this  example,  O  is  a 
K  x  K  matrix  with  elements  given  by  equation  (112).  In  general,  O  is  Hermitian  and  positive 
definite,  and  thus  has  spectral  decomposition  given  by 

K 

0  =  X  >-keke’,  (119) 

k=l 


where  Xj  >  \i  >  ...>  Xk  are  the  ordered,  real  positive  eigenvalues  of  Oj,  and  { }  are  the 
corresponding  eigenvectors  for  n=l,2,...,K. 
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10log[trl(var(a)|/tr|Ka|]  ^  10  log[tr|(var(a)j/tr|Ka|] 


EIGHT  ELEMENT  ARRAY 
TWO  ANGLE  CELLS 


-0- 

0  =  1 

BW/2  =  14.5° 

H- 

3BW/8  =  10.9  0 

-0- 

BW/4  =  7.25  0 

BW/8  =  3.62  0 

-D-  BW/16=  1.81  0 

gure  6.  Variance  Ratio  tr[var  (a)]/tr[KJ  vs  Angle  Cell  Width  A0  and  Received 
Signai-to-Noise  Ratio  pjs[Ca2  (Two  Angle  Cells) 


EIGHT  ELEMENT  ARRAY 
EIGHT  ANGLE  CELLS 


-O-  (D  =  I 
-+~  BW/2  =  14.5  0 
h-  3BW/8  =  10.9  0 
BW/4  =  7.25  0 
BW/8  =  3.62  0 
-t>  BW/16=  1.81  0 


101og[PNaa! 

Figure  7.  Variance  Ratio  trjvar  (a))/tr[Kaj  vs  Angle  Ceil  Width  A9  and  Received 
Signal- to- Noise  Ratio  pNcra2  (Eight  Angle  Cells) 


Figure  8  shows  X^i  versus  k  and  A9  for  the  narrowband  case  (lf(co)2 1  =  EfS(co  -  Oo)).  As 
A0  decreases,  the  condition  number  C[d>]  =  XiAk  increases  rapidly  and  O  becomes  sensitive  to 
noise  (and  other  perturbations  as  shown  in  reference  22).  However,  as  shown  previously  in  this 
example,  the  introduction  of  a  priori  information  in  the  form  of  a  Ka  can,  with  sufficient  received 
signal-to-noise  ratio  Pn<^,  reduce  the  variance  of  a,  thereby  compensating  for  an  ill-conditioned 
<D. 


EIGENVALUE  INDEX  (k) 


Figure  8.  Eigenvalue  Spread  for  Narrowband  d> 


Up  to  now,  the  waveform  in  this  example  has  been  assumed  to  be  narrowband  and 
approximated  by  E{6(co  -  co0).  To  examine  the  effect  of  bandwidth,  assume  that  lf(co)2!  is  given  by 


lf(©)2l  = 


(2jcB)-1,  lo)-co0l  <  7tB, 
0,  lo— co0l  >  JtB, 


(120) 


where  a>o  and  B  are  the  center  frequency  and  bandwidth,  respectively.  Figures  9  through  1 1  show 
(Xjc/Xi  versus  k  and  B/fo  for  various  values  of  A8  (BW/8,BW/4,3BW/8).  In  this  case,  K  =  16  to 
determine  if  increased  bandwidth  can  (in  this  example)  significantly  increase  values  of  A*  for 
k  >  N  =  8.  Note  that  the  rank  of  d>  is  minimal  (N,  K)  for  the  narrowband  approximation  (actually 
single  frequency).  This  is  clear  from  equation  (107),  since  P(co)  is  N  x  K.  For  the  case  at  hand, 
i.e.,  for  N  =  8,  K  =  16  increased  bandwidth  does  not  significantlly  amplify  A*  for  k  >  8,  except  for 
values  of  B/fo  unpractically  large  for  the  kinds  of  sonar  systems  under  consideration.  Of  course, 
this  conclusion  applies  to  this  example  and  is  qualified  accordingly.  Further  examination  of 
equation  (107)  should  produce  generalizations  regarding  the  conditionedness  of  O  as  it  depends  on 
array  and  waveform  design. 
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EIGENVALUE  INDE> 


Figure  9.  Eigenvalue  Spread  for 


CLASSIFIER  PERFORMANCE 


This  section  outlines  the  performance  of  the  optimum  processor  for  the  binary  classification 
problem.  It  briefly  covers  some  of  the  components  of  performance  prediction,  and  then  presents 
selected  results.  Both  the  details  of  classification  performance  prediction  and  a  large  number  of 
performance  examples  are  developed  in  appendix  A. 

Abrief  review  of  the  conditions  of  the  problem  is  given  first.  The  test  region  is  assumed  to 
be  identical  under  each  hypothesis;  each  scattering  coefficient  covariance  matrix  is  assumed  to  have 
full  rank,  and  the  noise  is  assumed  to  be  both  temporally  and  spatially  white.  For  the  binary 
problem  iuere  are  two  hypotheses:  and  Under  these  conditions,  the  sufficient  statistic  for 
deciding  whether  the  received  signal  was  from  object  1  or  2  was  given  in  equation  (100)  (a  number 
of  equations  will  be  repeated  here  for  continuity), 

L  =  yt(Hj  -  H2)y 

where  y  is  given  in  equation  (93), 

T 

y-J-  j  Ft(t)x(t)dt 

*’Oo 

and,  from  equation  (35), 

Hi  =  K,i  [I  +  PN<MC.i]'‘  =  [k;.1  +  PN®]'1,  i  =  1,2. 

Classification  performance  is  a  function  of  the  distribution  of  the  scalar  random  variable  L, 
under  Hi  or  H2.  If  py  is  defined  to  be  the  probability  of  selecting  object  i  when  object  j  occurred, 
then 

Pi  i(tj)  =  Jp(LI*i)dL,  (121a) 

n 
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P12OI)  =  |  p(LI^)dL,  (121b) 

n 

where  p(U:*0  is  the  probability  density  function  of  L  conditioned  on  the  i1*1  hypothesis  (i  =  1,2), 
and  r]  is  the  decision  threshold.  Pi  i  is  a  measurement  of  correct  classification,  while  P12  yields  a 
measure  of  incorrect  classification. 

It  is  shown  in  appendix  B  that  L  is  a  complex  quadratic  form  in  the  Gaussian  vector  y  (note 
that  the  matrix  Hi  -  H2  is  Hermitian  but  not  necessarily  positive  definite ).  The  characteristic 
function  of  quadratic  forms  in  complex  Gaussian  vectors  is  well  known  and  for  the  current 
problem  it  can  be  shown  to  be  (reference  23): 

Mi(jco)  s  E[exp  (jcoL)l =  [det  (Cj(jco))]  ,  i  =  1,2,  (122a) 


where 


Q(jGJ)  =  I-jo)Kyi(Hi  -  H2),  i  =  1,2,  (122b) 

and  det  (•)  is  the  determinant. 

The  probability  density  functions  of  interest  can  then  be  written  as 


p(LI^)=—  J  Mi(jo))exp(-jcoL)dco,  i  =  1,2  .  (123) 

2jc  — 

In  general,  considering  the  complexity  of  the  matrices  Hj  (i  =  1,2)  any  performance 
evaluation  based  on  equations  (122)  would  have  to  be  made  on  a  per  case  basis  by  numerical 
means  and  would  not  necessarily  yield  any  insight  into  the  classification  performance  of  the 
processor  in  general.  If  the  following  simplifying  assumptions  are  made,  some  insight  into  the 
processor  performance  may  be  gained.  First,  assume  that  the  waveform  and  the  array  are  capable 
of  perfect  resolution  in  range  and  angle  (for  the  particular  test  region  assumed).  That  is,  assume 

0  =  1.  (124) 
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Also  assume  that,  under  each  hypothesis,  the  K  scatterers  in  the  test  volume  are  statistically 
independent  so  the  Kai  now  represents  a  discrete  version  of  the  conventional  scattering  function 
(reference  2).  Thus,  Kai  is  a  diagonal  matrix  (i  =  1,2). 

Given  these  assumptions,  it  can  be  shown  that,  under  each  hypothesis,  L  consists  of  the  sum 
of  K  independent  random  variables: 

K 

L-Ilk, 

k  =  1 


where  lk  = 


Jk2 


A 


l+aklpN  l+aklpN 


V 


yl 


J 


(125) 


Here  yk  is  the  k1*1  element  of  the  vector  y,  Pn  is  the  input  signal-to-noise  ratio  Pn  =  2NE/N0,and 
is  the  kth  diagonal  element  of  Kai  (scattering  strength  of  the  k^  cell  under  the  i*  hypothesis). 
Thus,  under  each  of  the  two  hypotheses,  L  is  equal  to  the  sum  and/or  difference  (note  that  the 
bracketed  term  can  be  positive  or  negative)  of  independent,  not  necessarily  identically  distributed, 
exponential  random  variables.  Its  density  functions  p(Ll^d),  i  =  1,  2,  can  be  found  from  equation 
(123)  via  residue  theory.  These  can  be  used  to  solve  for  P;j  (i,j  =  1,2)  of  equations  (121). 

Before  addressing  a  specific  set  of  target  descriptions,  some  additional  points  should  be 
made.  The  test  volume  is  assumed  to  be  identical  for  each  hypothesis,  and  comprises  K  cells  with 
an  independent  specular  scatterer  in  each  cell.  If  under  a  specific  hypothesis  no  target  is  present, 
and  the  scattered  return  is  due  to  reverberation  alone,  all  cells  in  the  test  volume  will  possess 
uniform  scattering  strength.  If,  in  addition  to  reverberation,  a  target  is  present  under  a  specific 
hypothesis,  the  uniform  scattering  strength  of  a  number  of  the  cells  (£  K)  in  the  test  volume  may 
be  replaced  with  new  scattering  strengths;  the  number  of  cells,  their  location,  and  the  magnitude  of 
their  scattering  strengths  will  vary  according  to  the  target.  Thus,  it  is  assumed  that  each  individual 
scatterer  falls  into  one  of  two  categories,  target-like  or  reverberation,  and  the  difference  between 
hypotheses  M \  and  ^  is  due  to  the  number,  iocation,  and  scattering  strength  of  the  target-like 
scatterers.  Thus,  although  their  number  may  be  different,  it  is  assumed  in  these  examples  that 
reverberation  scatterers  are  of  equal  strength  under  both  hypotheses. 
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Because  perfect  resolution  has  been  assumed,  it  is  possible  to  index  the  K  cells  arbitrarily, 
but  identically,  under  both  J{\  and  Thus,  the  following  convention  will  be  adopted:  from  the 
K  test  cells,  find  the  total  J,  such  that  J  <  K  and 

cji  *  aj2’  J  =  (126) 


The  remaining  K  -  J  cells  of  equal  scattering  strength  do  not  contribute  to  equation  (125).  The 
specific  ordering  of  the  J  cells  with  unequal  scattering  strengths  can  be  arbitrary  as  mentioned 
before.  In  light  of  this,  the  intersection  of  reverberation  cells  under  9(\  and  ^  does  not  enter  into 
the  problem;  this  is  also  the  case  for  the  intersection  of  any  target-like  cells  of  equal  strength. 

As  an  example,  assume  that  each  target  is  comprises  of  scatterers  of  constant  strength. 

Under  hypothesis  1,  the  target  consists  of  M  cells,  each  with  a  receive  signal-to-noise  ratio 
(oklpN,  k  =  1,...,M)  of  p.  Under  hypothesis  2,  the  target  comprises  J  >  M  cells,  the  same  M 
cells  as  under  hypothesis  1,  and  J-M  additional  cells,  each  with  a  receive  signal-to-noise  ratio 
(cTj^Pn.  k  =  1,..  J)  of 'F  (see  figure  12  for  an  example  of  the  test  volume  geometry).  In  addition, 
it  is  assumed  that  the  total  target  scattering  strength  under  each  hypothesis  is  equal 
(Mp  =  PF).  Due  to  the  unequal  number  of  target  scatterers,  a  number  J  -  M  of  reverberation 
scatterers,  each  with  a  receive  signal-to-noise  ratio  equal  to  y,  must  be  accounted  for. 

In  figure  13  performance  curves  have  been  plotted  for  some  representative  values  of  p,  y,  y, 
M,  and  J.  Here  M  and  P  are  held  constant,  and  J  is  increased,  thus  \\f  is  decreased.  Reverberation 
is  set  equal  to  a  constant  value  in  each  curve  (y  =  \y/4).  The  results  indicate  that,  for  a  given  P12 
(probability  of  incorrect  classification),  there  is  a  particular  set  of  targets,  characterized  by  some 
M',  P',  and  J',  \/,  for  which  the  spreading  of  the  fixed  total  receive  energies,  M'P',  and  Jty',  make 
them  the  most  easily  distinguished  among  all  of  the  targets  with  constant  total  signal-to-noise 
ratios.  This  result,  the  spreading  of  the  total  fixed  receive  energy  over  a  particular  number  of 
independent  channels  (cells)  for  maximum  performance,  is  an  example  of  the  well-known 
“diversity”  phenomenon  (reference  2). 


49 


Two-dimensional  Test  Volume  Geometry;  K  =  36  Test  Cells; 
rget  Cell  Under  H,;  J  =  2,  10,  20  Target  Cells  Under  H2 
get  cell  locations  under  H,  are  a  subset  of  those  under  H2.) 


EXTENSIONS 


Important  extensions  of  this  work  will  be  considered  for  future  research.  Several  are 
discussed  below. 

MOVING  OBJECTS 

Throughout  this  report,  it  has  been  assumed  that  no  relative  motion  exists  between  the  array 
and  the  scattering  objects.  Consider  now  the  case  of  a  rigid  body  moving  with  constant  velocity; 
that  is,  if  Vfc(t)  is  the  velocity  of  the  cell  corresponding  to  the  i^1  object,  then  for  a  rigid  body 

Vik(t)  =  Vi(t),  k  =  1,2,. ..,Ki.  (127) 

and  for  uniform  motion 

Vi(t)*Vit  T!<t£T2.  (128) 


Figure  14.  Nomenclature  for  Object  Motion 
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As  can  be  seen  in  figure  14,  V;  is  relative  to  a  coordinate  system  with  its  origin  fixed  at  the 
receiving  array's  reference.  Moreover,  the  array  elements  are  assumed  to  be  stationary  within  this 
coordinate  system.  The  signal  scattered  from  i^1  object  and  received  at  the  n^1  array  element  is 
now  given  by 


Ki 

Sin(t)  =  X  aikf[t  -  Tink(t)],  Ti  <  t  <  T2,  (129) 

k=l 


where  the  1^(0  is  defined  as  in  equation  (11)  except  that  it  is  now  a  function  of  time’- 

An  expression  for  t;nk(t)  is  developed  here  for  completeness.  First,  for  notational  simplicity 
suppress  for  now  the  subscripts  and  time  dependence  of  x^t),  denoting  it  simply  by  x.  Then, 

x  =  x'  +  x".  (130) 

where  the  delay  x'  is  the  return  delay  (from  scatterer  to  array  element)  and  is  the  implicit  solution  of 
the  equation 

cx/  =  llrk(t-x,)-dnll;  (131) 


and  the  delay  x"  is  the  delay  from  the  array  reference  0  to  the  scattering  cell  and  is  given  by 


cx"  =  llrk(t-  xOH,  k  =  1,2,...,K;  n  =  1,2,...,N. 


(132) 


(Note  that  the  subscript  i  has  been  suspended  for  now.)  An  approximate  solution  to  the  above 
obtains  as  follows.  Let  V  =  Vrk  +  Vck  where  Vrk  =  (V  •  azotic  and  ak  is,  as  before,  the  unit 
vector  rk/  llrkll  and  rk  =  rk(0).  Thus,  Vrk  and  Vcr  are  the  (orthogonal)  down-range  and  cross¬ 
range,  components  of  Vk  respectively. 

Since 


rk(t)  =  Vt  +  rk,  Ti  <  t  <T2, 


(133) 


* 


Amplitude  scaling  by  (1  - 


dTink(0Vi 
dt  ; 


to  conserve  the  waveform's  energy  is  neglected. 
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then 


rk(t)  =  (rk  +  Vrkt)ak  +  Vckt 


(134) 


where  =  llrkll  and  Vrk  =  IIVrkll. 


Defining  dc*  by  d^  =  dn  -  (a  k-  dn)ak  and  substituting  equation  (134)  into  equation  (131) 


yields 


=  1 1  [rk  —  *  dn  +  Vrk(t  —  X0]ctk  —  dcjvt  +  Vck(t  —  X  )||. 


(135) 


Ignoring  the  cross-range  terms  in  equation  (135),  viz.,  d^  and  Vck,  one  can  make  the  following: 


ex'  =  rk  —  ak  •  dn  +  Vrk(t  - x'). 


(136) 


Solving  for  x' (t)  yields 


t'W  =  (rk  -  ak  •  d„  +  Vrkt). 


(137a) 


Using  equations  (132),  (135)  with  Vck  s  0,  and  (137a),  one  can  express  x"(t)  as 


t"(0  =  (rk  +  ^ak  •  dn  +  Vrkt). 


(137b) 


Then,  from  equations  (130)  and  (137),  it  follows  that  (restoring  the  subscript  where  appropriate 
and  the  subscripts  k,  n  on  x) 


1  ~  ^ink(0  =  Pik  I  2  ^  ^  +  P  ik  ^  n  ’ 


(138) 


where 


o  _  £  ~  Vjrk 
Pik  ‘  c  +  V irk 


(139) 


and,  as  before,  X&  =  2r<Jc. 
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Equation  (139)  is  ,  of  course,  the  familiar  expression  for  the  Doppler  effect  of  temporal 
dilation  or  contraction.  In  the  second  term  on  the  right  side  of  equation  (138),  assume  that  (3^  »  1 
then. 


t —  ^ink(t)  —  Pik(t  —  'Cik  —  ^&ik  ’  dn). 


(140) 


Now,  for  moving  objects,  the  signal  model  becomes,  from  equations  (129)  and  (140), 


Ki 

SinW  =  X  aikf  [Pik(t  -  tink)].  ^  <  t  <  T2, 

k=l 


(141) 


where,  as  before  (see  equation  (110)), 

Tink  =  ^ik  —  ^®ik  ’  tin.  (142) 

Define  the  N  x  matrix  Fi(t;V,)  by 

[Fi(t;Vi)]nk  =  f  [Pik(t  -  link)],  Ti  t  £  T2,  (143) 

where  n  =  1,2,. ..,N,  and  k  =  1,2,...,K,.  Note  that  Fj(t;0)  =  F,(t),  as  previously  defined 
(see  equation  (14)).  In  vector  form,  equation  (141)  is 


Si(t)  =  Fi(t;Vi)ai  . 


(144) 


The  results  previously  derived  for  the  case  of  V;  =  0  (i  =  1 ,2,...,1)  can  now  be  applied  by 
substituting  Fj(t;Vj)  for  Fj(t).  The  matrix  O;  now  becomes  <t>,(  V,),  which  is  defined  by 

t2 

*i(Vi)  =  ^  J  F! (t;Vi)Fi(t;Vi)dt;  (145a) 

and,  of  course,  O,  =  d>i(0).  Again  using  Parsevai’s  theorem.  <t>,(  V,)  can  be  written  as 
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oo 


[*i (Vi)]jk  -  (NEdPijllPikl)'1  J  f*(PljCO)X  expaoXP^-  P^))  ^ 

27C 


(145b) 


n=l 


This  equation  shows  explicitly  how  the  Doppler  effect,  viz  p*,  figures  into  the  spatial 
resolution  (and  ambiguity)  of  the  processor.  This  case  will  be  further  developed  in  a  sequel  to  this 
report.  The  expression  for  Oj  (V^)  can  be  interpreted  in  terms  of  the  synthetic  aperture  effect.  The 
effect  on  resolution  of  the  processor  can  be  characterized  by  examining  the  dependence  of  the 
variance  ratio  (equation  (116))  on  \Ck  (and,  of  course,  the  other  important  parameters  such  as  A0, 
Pn,  etc).  Moreover,  the  eigenanalysis  shown  in  equations  (111)  and  ( 1 12)  can  be  modified  to 
accommodate  the  synthetic  aperture  effect.  Such  analyses  should  reveal  important  interre¬ 
lationships  among  object  scattering  characteristics  (as  modeled  herein),  waveform  and  array 
design,  and  motion  induced  effects  within  the  unifying  framework  of  the  Bayes-optimal  classifier. 

NON-GAUSSIAN  SIGNAL 

To  determine  the  Bayes-optimal  classifier,  x(t)  was  assumed  to  be  a  vector-valued  Gaussian 
stochastic  process.  Recall  that  the  model  for  x(t)  is  given  by  (under  i  =  1,2,. ..,1) 

x(t)  =  Sj(t)  +  n(t), 

Ki 

Sin(0  =  ^  3iijf(t  —  Tinfc),  n  =  1,2,...,N, 

k=l 


and  the  noise  process  n(t)  was  assumed  to  be  white  and  Gaussian.  The  scattering  coefficients 
{afcjwere  not  necessarily  assumed  to  be  Gaussian;  however,  the  sum  Sin(t)  was  assumed  to  be 
Gaussian,  implicitly  invoking  the  central  limit  theorem  as  is  usually  done.  Suppose  that  the 
probability  density  function  (pdf)  of  the  random  vector  a;,  denoted  by  gj  (•),  is  now  introduced 
explicitly  and  that  it  is  not  necessarily  Gaussian.  Since  x  is  the  discrete  representation  of  the 
process  x(t)  as  per  equation  (70),  let  the  conditional  pdf  of  x  given  a,  =  A*  (and  given  !>$)  be 
denoted  by  p(xlA;;  2£),  where  A;  is  some  arbitrary  realization  of  the  random  vector  a;.  Then,  the 
pdf  of  x  given  ^  can  be  expressed  as 

p(xl#Q  =  |  p(xlAi;^)gi(Ai)dAi.  (146) 
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The  likelihood  ratio  in  equation  (44)  is  now 


<1J7) 

or,  as  expected, 

Ai(x)  =  J  AiCxIAOgiCAOdAi,  (148) 

where  AjfxlAj  )is  the  conditional  likelihood  ratio  defined  by 


Ai(xlAi) 


p(xlAj;i^) 
p(xl^j)  • 


By  equations  (63)  and  (66),  under  H  and  given  aj  =  A,, 
x  =  Fi  A;  +  n  , 


(149) 


(150) 


where  n  and  Fj  are  as  previously  defined  (equations  (53)  and  (68),  respectively).  Then, 

Ai(xlAi)  =  P"(p^;Ai)  .  (151) 

where  pn(-)  is  the  pdf  of  n.  It  is  easy  to  show  that  if  p„(-)  is  the  Gaussian  pdf,  then  the 
conditional  likelihood  ratio  depends  on  x  only  through  y*  =  F^K^x,  the  matched  filter/beam- 
former  operation  (with  prewhitening).  This  is  so  regardless  of  the  particular  form  of  gj(-);  of 
course,  the  particular  functional  dependence  of  A*(-)  on  yt  is  determined  by  gj(-),  and  is  not,  in 
general,  quadratic  as  when  a*  is  Gaussian.  Future  work  will  address  alternatives  to  the  Gaussian 
assumption  with  emphasis  on  the  case  of  non-Gaussian  stochastic  signals  and  Gaussian  noise. 
Much  of  the  literature  on  the  non-Gaussian  (detection)  problem  focuses  on  the  case  of  non- 
Gaussian  noise  and  of  a  signal  that  is  either  known  or  stochastic.  However,  the  non-Gaussian 
character  of  the  signal  is  not  explicitly  characterized  in  the  same  way  the  noise  is.  The  pdf  of  the 
signal  is  not  explicitly  introduced;  usually,  only  its  mean  and  covariance  appear,  as  in  the  case  of 
the  locally  optimal  Bayes  detectors  (see  references  25  and  26,  for  general  treatment).  Expansions 
including  third  and  even  higher  order  moments  should  be  investigated,  taking  into  account  the 
important  distinguishing  features  of  the  classification  problem  as  formulated  in  this  report.  This 
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includes  the  possibility  —  a  likely  one  in  some  important  applications  —  that  the  distribution  of  the 
random  signals  will  vary  among  alternative  hypotheses  {9i). 


CONCLUSIONS 

The  active  sonar  classification  problem  can  approached  as  a  likelihood  ratio  test  of  multiple, 
alternative  hypotheses  versus  a  noise-only  null  hypothesis.  The  data  are,  in  general,  vector-valued 
stochastic  processes  representing  measurements  from  individual  elements  within  a  sonar  array. 
Given  an  explicit  form  for  the  received  signal  model,  which  is  statistically  characterized  for  each 
alternative  hypothesis  (target  class),  explicit  results  can  be  derived  for  the  likelihood  ratio  and 
various  performance  characteristics.  Moreover,  the  optimal  processor  can  be  interpreted  as  an 
acoustic  image  processor.  Generalizations  of  the  results  have  been  indicated  and  in  some  cases 
addressed  in  detail  (e.g.,  the  case  of  moving  targets). 
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APPENDIX  A 

A  CONTINUOUS  MODEL  FOR  THE  SIGNAL 


In  this  appendix  an  optimal  estimator  of  the  random  field  a,(r)  is  derived  The  field  ai(r) 
appears  in  the  signal  model  given  by 

Si(t,d)  =  J  aj(r)f(t  -  x(r,d))dr,  (A-l) 

where  Si(t,d)  is  the  signal  at  position  d  and  time  t;  f(t)  is  the  transmitted  waveform;  the  delay  x(r,d) 
for  a  non-refractive  medium  with  sound  speed  c  is  given  by 

x(r,d)  =  r  (Hr  -  dll  +  r).  (A-2) 

v 

The  region  ^  contains  the  i *  object.  Assume  that  s,(t,d)  is  measured  at  N  discrete  sensor 
positions  d„,  n  =  1,2,... ,N,  as  is  usual  in  practice.  Then, 

sin(t)  =  J  aj(r)f(t  -  x(r,dn)dr,  (A-3) 

Si 


Ki 

where  s,n(t)  =  Si(t,dn),  n  =  1,2,...N.  When  a(r)  =  £  aj5(r  —  r^J,  equation  (A-3)  reduces  to  the 

i=l 

discrete  model  of  equation  (11)  with  x^  =  x(rijc,dn).  The  problem  addressed  here  is  the 
estimation  of  the  field  a,(r)  given  the  measurements  x(t),  where  under  the  hypothesis  H 

x(t)  =  Sj(t)  +  n(t),  (A-4) 

where  s(t)  =  [Sii(t),Si2(t),...,SiN(t)]T,  x(t)  =  [xi(t),x2(t),...,xN(t)]T,  and 

n(t)  =  [ni(t),n2(t),...,nN(t)]T,  the  measurement  noise.  (In  the  remainder  of  this  appendix  the 

subscript  i  will  be  deleted.) 

Assume  the  following  about  the  random  field  a(r): 

E[a(r)j  =  0,  (A-5) 


A-l 


and  assume  that  the  covariance  Ka(rhr2)  =  E[a(ri)a*(r2)]  is  known  for  all  ri,r2  e  %.  Regarding 
the  noise,  assume,  for  now,  that  E(n(t))  =  0  for  all  t  e  [Ti,T2]  and  that  Kn(ti,t2)  =  E(n(ti),nt(t2)) 
is  known  for  all  ti,t2  e  [T^TJ;  moreover,  assume  that  E[a(r)n*(t)]  =  0.  Then,  the  following 
relations  hold  for  E[x(t)]  and  Kx(ti,t2)  =  EfrCtOx^)]: 

E[x(t)]  =  0,  (A-6) 


and 


Kx(ti,t2)  =  J  |  Ka(n,r2)f(ti,ri)ft(t2,r2)dridr2  +  Kn(ti,t2),  (A-7) 

x  x 

T 

where  f(t,r)  =  [f(t  -  x  (r,di),f(t  -  't(r,d2),...,f(t  -  t(r,dN)J  •  Now,  let  a(r)  denote  the  best 
(minimum  variance)  linear,  unbiased  estimator  of  a(r)  for  all  r  6  'Rj,  thus, 

E[a(r)]=E[a(r)],  (A-8) 

and  the  variance  Ca(r)  of  the  error  a(r)  -  a(r)  is  given  by 

Ca(r)  =  E[|a(r)  -  a'(r)|2]. 

Since  a(r)  is  a  linear  estimator,  it  is  of  the  form 


a(r)  =  J  hT(r,t)x(t)dt,  (A-9) 

where  the  vector-valued  function  h(r,t)  is  to  be  determined,  and  temporal  integration  is  over 
[Ti.TiJ.  From  the  orthogonal  projection  lemma 

cov[(a(r),  (a(r)  -  a(r))]  =  0;  (A-10) 

or,  from  equations  (A-5)  and  (A-8), 

E(IS(r)l2)  =  E[(a(r)a*(r)].  (A-ll) 


A-2 


Now  from  equation  (A-9), 


E[la(r)|2]  =  J  JhT(rftl)Kx(t1,t2)h*(rft2)dt1dt2f 
and 


(A-12) 


E[a(r)a*(r)]  =  J  E[a(r)xt(t)h*(r,t)]dt.  (A-13) 

Combining  equations  (A-12)  and  (A-13)  according  to  equation  (A-l  1)  yields 

J  (E[a(r)xt(t2)]  -  JhT(r,ti)Kx(ti,t2)dti }  h*(r,t2)dt2  =  0  (A-14) 

identically  for  all  r  €  ^  Therefore,  the  expression  within  (•]  of  equation  (A-14)  must  be 
identically  zero  for  h(r,t)  *  0;  that  is, 

E[a(r)xt(t)J  =  J  hT(r,t')Kx(t,t')dt',  (A-l  5) 

the  Weiner-Hopf  equation. 

Since 

x(t)  =  J  a(r)f(t,r)dr  +  n(t),  (A-l  6) 

at 

then 

E(a(r)xt(t)]  =  J  ft(t,r')K«(r,r,)dr'.  (A-l  7) 

x. 

Substituting  equations  (A-l)  and  (A- 17)  into  the  Weiner-Hopf  equation  gives 


J  f+(t,r')Ka(r,r')dr'  =  J  J  J  h%,Of(t',r')dt'Ka(r',r'')f (t,r")dr'dr' 


J  hT(rft#)K„(t/,t)dt/. 


(A*  18) 


Now,  define  a  function  tfri,r2)  over  ^.x  ^such  that 


hT(r,t)  =  J  Y(r,r')ff(t,r')dr. 

■K 


(A- 19) 


Then,  equation  (A- 18)  becomes 


J  ft(t,r')Ka(r,r')dr'=  J  J  J  J  Y(r,r'")ft(t^^",)f(t,r,)dt,Ka(r/,r")ft(t,r")d^'d^"d^, 


X. 


+ 1  J  7(r,r')f1’(t',r')Kn(t',t)dt/dr' 


(A-20) 


Defining  the  signal  (spatial)  correlation  function  0(ri,r2)  by 


<t>(ri,r2)  =  J  ft(t,r1)f(t,r2)dt 


(A-21) 


makes  equation  (A-20)  become 


J  ft(t,r')Ka(r,r,)dr'  =  J  /  J  y(r,r',0<t)(r'",r')Ka(r,,r")f+(t,r/')dr'dr"dr/ 


x. 


+  ^  J  y(r,r,)ft(t,r')dr', 


(A-22) 


N 

where  it  is  now  assumed  that  Kn(ti,t2)  =  -y6(ti  -  t2)I,  a  white  noise  process.  After  rearranging 
terms,  one  can  write  equation  (A-22)  as 

j  f*Ka(r,r")  -}  J^(r,r/>(r^rOKa(^r/Wdr///-^Y(r,r,oV(t,r")dr//  =  0.  (A-23) 

z  L  * 

Since  f(t,r)  *  0  for  all  t  e  [Ti,T2]  and  re  H,  then  the  expression  within  [•]  in  equation  (A-23)  is 
identically  zero;  that  is, 

J  Y(n,r')  J 0(r',r")Ka(r",r2)dr'dr"  +  ^Y(rt,r2)  =  Ka(ri,r2).  (A-24) 

z  z 

Define  the  function  P(ri,r2)  over  !Rjx  ^.by 

P(ri,r2)=  J  <JKri,r')Ka(r',r2)dr';  (A-25) 

z 

then,  equation  (A-24)  can  be  written  as 

J  Y(ri,r,)P(r',r2)dr'  +  S*  Y(ri,r2)  =  Ka(rbr2).  (A-26) 

z 

In  summary,  the  minimum  variance,  linear,  unbiased  estimator  of  the  random  field  a(r)  is 
given  by,  combining  equations  (A-9)  and  (A- 19), 

a(r)  =  j  y(r,r ') J  f*(t,r')x(t)dtdr',  (A-27) 

z 

where  the  kemal  y(ri,r2)  is  the  solution  to  the  integral  equation  (A-26). 

To  complete  this  treatment  of  the  continuous  model  an  expression  for  the  error  variance  Ca(r) 
=  E[|a(r)  -  a(r)|2]  is  derived.  Now,  Ca(r)  can  be  written  as 

Ca(r)  =  E { a*(r)[a(r)  -  a(r)] }  -E{a*(r)[a(r)  -a(r)]}.  (A-28) 


A-5 


The  second  term  on  the  right  of  equation  (A-28)  vanishes  because  of  the  orthogonal  projection 
lemma,  equation  (A-10).  Next,  by  substituting  equation  (A-16)  into  equation  (A-27),  a(r) 
becomes 


a(r)=  /  /  tfrir'WrVWOdr'dr" 

X.  X. 

+  J  J  Y(r,r')ft(t,r')n(t)dr/dt.  (A-29) 

X. 

From  equation  (A-29),  E{a*(r)[a(r)  — a(r)] }  becomes 

E{a*(r)[a(r)  -  a(r)]}  =  Ka(r,r)  -  J  J y(r,r')0(r',r")Ka(r,,,r)dr,dr".  (A-30) 

X.  X. 

The  second  term  on  the  right  of  equation  (A-29)  is  uncorrelated  with  a*(r)  since  E[(a*(r)n(t)]  =  0. 
The  expression  for  Ca(r)  becomes,  using  the  definition  of  |3(ri,r2)  in  equation  (A-25), 

Ca(r)  =  Ka(r,r)  -  J Y(r,r')(3(r',r)dr/,  (A-31) 

x. 

which  finally  becomes,  using  equation  (A-26), 

Ca(r)=^Y<r,r),  (A-32) 

where,  as  before,  Y(ri-r2)  is  the  solution  to  the  integral  equation  (A-26). 
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APPENDIX  B 

PERFORMANCE  FOR  THE  BINARY  CLASSIFICATION  PROBLEM 

BACKGROUND 

This  appendix  addresses  the  problem  of  determining  the  performance  of  the  optimum 
processor  for  the  binary  classification  problem  described  in  the  main  text.  The  intent  is  to  first 
present  the  details  of  classification  performance  prediction  for  the  general  binary  case.  It  is  shown 
that  analytic  solutions  to  classification  performance  prediction  are  difficult  to  obtain  for  general 
target  geometries  and  array/waveform  configurations;  therefore,  simplifying  assumptions  are  made 
so  that  analytic  solutions  can  be  derived.  These  are  used  to  illustrate  fairly  simple  but  instructive 
cases. 

The  conditions  of  the  problem  can  be  briefly  restated  as  follows:  the  test  region  is  assumed 
to  be  identical  under  each  hypothesis;  each  scattering  coefficient  covariance  matrix  is  assumed  to 
have  full  rank,  and  the  noise  is  assumed  to  be  both  temporally  and  spatially  white.  For  the  binary 
problem  there  are  two  hypotheses;  9{\  and  fy.  Under  these  conditions,  the  sufficient  statistic  for 
deciding  whether  the  received  signal  was  from  object  1  or  2  was  shown  in  equation  (100)  to  be 

L  =  yt(H1-H2)y,  (B-l) 


where 


t2 


y  =  J  F+(t)  x(t)  dt, 


(B-2) 


and 


h, = K„r 


I  +  pN^Kaj 


-,-1 


r^-i 


Ka  +  pnO 


i  =  1,  2. 


(B-3) 
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PERFORMANCE  MEASURE 


Classification  performance  is  a  function  of  the  distribution  of  the  scalar  random  variable  L 
under  X\  or  fy.  If  Py  is  defined  to  be  the  probability  of  selecting  object  i  when  object  j 
occurred,then 


PuCn)  =  Jp(LI*i)dL, 


(B*4a) 


i 

P 22(^1)  =  J  p(LI^2)  dL, 


(B-4b) 


P12OI)  =  J  p(LI^2)  dL, 


(B*5a) 


P2iCH)  =  J  p(LI#i)  dL, 


(B-5b) 


where  p  (Ll^)  is  the  probability  density  function  of  L  conditioned  on  the  i1*1  hypothesis  (i  =  1,  2), 
and  T|  is  the  decision  threshold.  Pn  and  P22  are  measures  of  correct  classification,  while  P12  and 
P21  yield  measures  of  incorrect  classification. 


Let  us  determine  the  density  functions  p(LI?£),  1  =  1.2.  Note  that  y  is  a  linear  functional  of 
the  vector-valued,  complex  Gaussian  process  x(t);  therefore,  it  is  also  a  complex  Gaussian  vector 


E  [yl#i]  =  0  ,  i  =  1,  2, 


(B-6) 
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Kyi  =  E  [y  yt|#j]  =  pj  O  Kai  <D+  +  pN  <D  ,  i  =  1,  2, 


(B-7) 


where  E  [•]  denotes  expectation. 

L  is  therefore  a  complex  quadratic  form  in  the  Gaussian  vector  y  (note  that  the  matrix 
Hi  -  H2  is  Hermetian  but  not  necessarily  positive  definite-).  The  characteristic  function  of 
quadratic  forms  in  complex  Gaussian  vectors  is  well  known  and  for  the  current  problem  it  can  be 
shown  to  be  (reference  23) 

M,(jo))sE  [exp(jcoL)li^]  =  { det  [Ci(jco)]}  ,  i  =  1,2,  (B-8a) 


where 


CiO'co)  =  I  -  jcoKyi  (Hi  -  H2),  i  =  1,2,  (B-8b) 

and  det  [•]  is  the  determinant. 

Moreover,  it  can  be  shown  that  (reference  25) 

K 

MiG©)  =  FI  (1  "  j“Cki)~l .  (B-8c) 

k=l 

where  £ki  is  the  kth  eigenvalue  of  the  matrix  (Hi  -  H2),  Kyi  i  =  1,2.  The  probability  density 
functions  of  interest  can  then  be  written  as 

OO 

p(  UOi )  =  —  J  MiG©)  exp(-jcoL)  dco,  i  =  1,2.  (B-9) 
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Diagonal  Matrices 

In  general,  considering  the  complexity  of  the  matrices  Hj  (i  =  1,2),  any  performance 
evaluation  based  on  equation  (B-8)  would  have  to  be  made  on  a  per  case  basis  by  numerical  means 
and  would  not  necessarily  yield  any  insight  into  the  classification  performance  of  the  processor.  If 
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the  following  simplifying  assumptions  are  made,  some  insight  into  processor  performance  may  be 
gained.  First,  assume  that  the  waveform  and  the  array  are  capable  of  perfect  resolution  in  range 
and  angle  (for  the  particular  test  region  assumed).  That  is,  assume 

®  =  I  (B-10) 


Also,  assume  that  under  each  hypothesis,  the  K  scatterers  in  the  test  volume  are  statistically 
independent  so  that  Kai  now  represents  a  discrete  version  of  the  conventional  scattering  function 
(reference  2).  Thus,  Kai  is  a  diagonal  matrix  (i  =  1,2).  Given  these  assumptions,  Hj,  Kyi,  and 
therefore  Ct(jco)  can  be  shown  to  be  diagonal  matrices  and  equation  (B-8)  can  then  be  written  as 


K 


Mi(jco) = n 


k=i  [C,(j(o)]kk’ 


i«  1,2, 


(B- 11) 


where  [Ci(jco)]kk  is  the  k£h  diagonal  element  of  Ci(jco).  Thus,  under  each  hypothesis,  l  consists 
of  the  sum  of  K  independent  random  variables 


L=  X  lk, 

k=  1 


where 


(  <4 


lk* 


<4  ^ 


1+CT? 


V 


klpN  1+C1c2Pn 


2 

yk 


J 


Here  Pn  is  the  input  signal -to- noise  ratio  =  2NEf/N0,  and  is  the  kth  diagonal  element  of  Kai 
(scattering  strength  of  the  k£h  cell  under  the  i1^  hypothesis). 

The  density  functions  for  lk  (k  =  1,2,...,K)  will  now  be  determined.  If  a  generalized  receive 

2  2 

signal-to-noise  ratio  did  =  PN°ki  =  2  Oki  NEf/N0  is  defined  for  each  cell  then 


1  1 
[Ci(jW)]kk  -  (l-jcoctki)’ 


(B-12a) 


where 
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(B- 12b) 


CCkl  = 


dki-<ik2 

dk2+l 


and 


1  _  1 
[C2(jO))]kk  (l-jcoctk2) 


(B-13a) 


where 


«k2  = 


dki-dk2 

dki+1 


I 

(B-13b)  j 


Therefore,  each  of  the  K  components  1*  of  L  is  distributed  as  follows: 


oo 


=  0, 


-k-  <  0. 

<*ki 


(B-14) 


Under  each  of  the  two  hypotheses,  L  is  equal  to  the  sum  and/or  difference  (note  aid  can  be 
positive  or  negative)  of  independent,  not  necessarily  identically  distributed,  exponential  random 
variables.  Its  density  functions  p(LI ^  (i  =  1,2)  can  be  found  from  equation  (B-9)  via  residue 
theory.  These  functions  can  be  used  to  solve  for  P;j  (i,j  =  1,  2)  of  equations  (B-4)  and  (B-5). 

Some  examples  will  now  be  investigated  by  specifying  values  for  the  scattering  strengths  aj^ 
or,  equivalently,  the  generalized  receive  signal-to-noise  ratios  d^  (k  =  1,2 . K  and  i  =  1,  2). 


PERFORMANCE  EXAMPLES 


Before  addressing  specific  target  descriptions,  some  additional  points  should  be  made.  The 
test  volume  is  assumed  to  be  identical  for  each  hypothesis,  and  it  comprises  K  cells  with  an 
independent  specular  scatterer  in  each  cell.  If  under  a  specific  hypothesis  no  target  is  present,  and 
the  scattered  return  is  due  to  reverberation  alone,  all  cells  in  the  test  volume  will  posseses  uniform 
scattering  strength.  If,  in  addition  to  reverberation,  a  target  is  present  under  a  specific  hypothesis, 
the  uniform  scattering  strength  of  a  number  of  the  cells  (<  K)  in  the  test  volume  may  be  replaced 
with  new  scattering  strengths;  the  number  of  cells,  their  location,  and  the  magnitude  of  their 
scattering  strengths  will  vary  according  to  the  target.  Thus,  it  is  assumed  that  each  individual 
scatterer  falls  into  one  of  two  categories:  target-like  or  reverberation,  and  the  difference  between 
hypotheses  and  is  due  to  the  number,  location,  and  scattering  strength  of  the  target-like 
scatterers.  Thus,  although  their  number  may  be  different,  it  is  assumed  in  these  examples  that 
reverberation  scatterers  are  of  equal  strength  under  both  hypotheses. 

Because  perfect  resolution  has  been  assumed,  it  is  possible  to  index  the  K  cells  arbitrarily, 
but  identically,  under  both  and  Thus,  the  following  convention  will  be  adopted:  from  the 
K  test  cells,  find  the  total  J  such  that  J  <  K  and, 

dji*dj2,  j  =  1,  2, ....  J.  (B-15) 

fhe  remaining  K  -  J  cells  of  equal  scattering  strength  do  not  contribute  to  either  equation  (B-12)  or 
tB- 13)  and,  therefore,  have  no  effect  on  the  characteristic  function  in  equation  (B-l  1).  The 
specific  ordering  of  the  J  cells  with  unequal  scattering  strengths  can  be  arbitrary  as  mentioned 
lefore.  In  light  of  this,  the  intersection  of  reverberation  cells  under  and  does  not  enter  into 
the  problem;  this  is  also  the  case  for  the  intersection  of  any  target-like  cells  of  equal  strength. 

Two  examples  will  now  be  examined.  See  table  B-l  for  a  summary  of  the  target 
descriptions. 
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Table  B-l.  Target  Descriptions 


*2 

Comments 

Case  1.  J  target  cells 
dki  =  P,  k  =  1,2,  ....  J 

K  -  J  reverberation  cells 
dki=Y,k  =  J+  1,  ...,K 

J  target  cells 

dk2  =  k  =  1,  2,  ...  J 

K  -  J  reverberation  cells 

dk2  =  Y.  k  =  J  +  1,  ...  K 

Assume  [3  >  *F. 

Targets  collocated  under  tf\, 

K  -  J  reverberation  cells  do  not 
affect  performance 

Case  2.  M  <  J  target  cells 
dkl  =  p,k  =  1,2,  ....  M 

K  -  M  reverberation  cells 

dki=Y,k  =  J+l . K 

J  target  cells 

dk2  =  ^,k=  1,2,  ...  J 

K  -  J  reverberation  cells 

dki  =  Y-  k  =  J  +  1 . K 

Assume  Mp  =  TF  and  y  <  'F  <  p. 

K  -  J  reverberation  cells  do  not 

affect  performance,  but  J  -  M  +  1 

reverberation  cells  do. 

Cass-L 


The  target  is  located  in  the  exact  same  J  ceils  under  each  hypothesis,  and  each  target  is 
assumed  to  have  uniform  scattering  strength  (see  figure  B- 1  a  for  an  example  of  the  test  volume 
geometry).  Under  these  conditions,  performance  is  a  function  of  the  number  of  target  cells  J  and 
their  relative  strengths  (3  and  y;  the  remaining  K  -  J  cells  that  correspond  to  reverberation  do  not 
affect  performance  (see  equations  (B-l 2b)  and  (B-l  3b) >. 
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Figure  B- 1 


REVERBERATION  CELL  (7) 


|U  TARGET  CELL  H,  (p) 


TARGET  CELL  H2 


OF) 


Example  of  a  Two-dimensional  Test  Volume  for  Case  1;  K  =  36  Test  Cells; 

J  =  10  Target  Cells  (Note:  Target  cell  locations  identical  under  each  hypothesis.) 


AZIMUTH 


AZIMUTH 

REVERBERATION  CELL  (y) 


TARGET  CELL  Ht  (J3) 
TARGET  CELL  H2  OF) 


Figure  B-  lb.  Example  of  a  Two-dimensional  Test  Volume  for  Case  1;  K  =  36  Test  Cells 
M  =  1  Target  Cell  Under  Ht;  J  =  10  Target  Cells  Under  H2  (Note:  Target 
cell  locations  under  H,  are  a  subset  of  those  under  H2.) 


P-y 

OCkl  =  — 7  s  K  , 

y+l 


P-y 

«k2=  1  = 

P+1 


k  =  1.  2 . J, 


k  =  J+  1,  J  +  2 . K, 


k  =  1,  2,  ...,  J, 


(B-16a) 


(B-16b) 


k  =  J  +  i,  J  +  2 . K. 


Since  P  >  \j/,  k  and  v  are  both  positive.  Thus,  under  each  hypothesis,  L  follows  a  gamma 
distribution: 


p<us4),Si"'('«)' 


for  L  >  0, 


for  L  <  0. 


P<Ll^)=^Si“P(-v 


for  L  >  0. 


for  L  <  0 


This  yields  the  following  expressions  for  classifier  performance: 


(B-17a) 


(B-17b) 


PnCDW-1)!  r?\. 

I  KJk=0  k!Kk 


(B-18a) 


Pu(n)  =  expf  -  -]  X 


rjVv  rlk  f  ( i  ♦  k-  »  v  nkU+K)k 

-  L  7~z  =  exP|- -  1  r7T~’ 

v  J k=o  k!vk  I  k  i.o  k!xk 


(B-18b) 


since  v  =  k/(1+k)  . 
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Figure  B-2  shows  the  performance  curve  (Pu  versus  P12)  for  some  representative  values  of 
(3,  y,  and  J.  In  this  example,  both  the  scattering  strength  ratio  (P/y)  and  total  target  receive  signal  - 
to-noise  ratios  (J(3  and  Jy)  are  held  constant  as  the  number  of  cells  encompassed  by  the  target  is 
increased;  therefore,  the  individual  cell  receive  signal-to-noise  ratios  (P  and  y)  decrease.  Each 
target  in  this  example  can  be  thought  of  as  belonging  to  a  class  of  targets  characterized  by  constant 
target  receive  signal-to-noise  ratio  (jp  or  Jy). 

The  results  indicate  that,  for  a  given  P12  (probability  of  incon-ect  classification),  there  is  a 
particular  set  of  targets  (characterized  by  J',  p'  and  J',  yO,  taken  from  the  two  classes,  that  are 
most  easily  distinguished  from  each  other.  This  is  also  represented  in  figure  B-3,  where  Pn 
(probability  of  correct  classification)  is  plotted  as  a  function  of  the  number  of  cells  J  for  a  fixed 
P12.  A  family  a  curves,  parameterized  by  fixed  scattering  strength  ratio  P/y  is  shown.  This  plot 
illustrates  that  there  is  a  partitioning  of  the  independent  target  scattering  strengths  into  a  particular 
number  of  cells,  for  which  performance  is  best.  This  result,  spreading  of  the  total  fixed  receive 
energy  over  a  particular  number  of  independent  channels  (target  cells)  for  maximum  performance, 
is  an  example  of  the  well-known  “diversity”  phenomenon  (reference  2).  Of  course,  in  the  present 
example,  the  system  designer  has  no  control  over  the  target  physics  so  that  the  result  indicates 
only  which  set  of  targets  within  the  class  are  most  easily  distinguished  from  each  other. 

As  a  second  example  of  Case  1,  let  the  number  of  target  cells  under  each  hypothesis  J  and  the 
total  target  receive  signal-to-noise  ratio  under  Jy  remain  fixed.  Let  the  individual  cell  receive 
signal-to-noise  ratio  under  "H\,  P  increase.  As  in  the  first  example,  the  remaining  K  -  J 
reverberation  cells  do  not  affect  performance. 

As  expected,  performance  improves  with  increased  p.  The  results,  shown  in  figure  B-4, 
have  a  number  of  interesting  interpretations.  First,  as  P  is  increased,  the  performance  gain  can  be 
viewed  as  the  result  of  increasing  the  total  target  receive  signal-to-noise  ratio  under  relative  to 
that  under  “Hi-  Second,  assume  that  y  =  y,  that  is,  assume  that  hypothesis  2  represents 
reverberation  only;  all  K  cells  under  ^  have  a  receive  signal-to-noise  ratio  of  y.  Although  K  -  J 
reverberation  cells  do  not  affect  performance,  J  of  the  cells  now  do.  As  P  is  increased,  the 
performance  gain  can  be  viewed  as  increasing  the  total  target  receive  signal-to-noise  ratio  under  7{\ 
relative  to  the  total  receive  signal-to-noise  ratio  of  J  of  the  reverberation  cells  under  #2.  Third, 
assume  that  P  =  y,  that  is,  assume  that  hypothesis  1  represents  reverberation  only.  Recall  that 
P  >  y,  so  that  in  this  interpretation  greater  performance  gains  are  made  as  the  total  receive  signal- 
to-noise  ratio  of  the  reverberation  increases  beyond  that  of  the  target.  This  interpretation  could 
have  relevance  to  target  strength  reduction  efforts. 
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As  in  example  1,  each  target  comprises  scatterers  of  constant  strength.  Under  hypothesis  1, 
the  target  consists  of  M  cells.  Under  hypothesis  2,  the  target  comprises  J  >  M  cells,  the  same  M 
cells  as  under  hypothesis  1,  and  J  -  M  additional  cells  (see  figure  B-lb  for  an  example  of  the  test 
volume  geometry).  In  addition,  it  is  assumed  that  the  total  target  scattering  strength  under  each 
hypothesis  is  equal  (M(3  =  J\j/).  Because  of  the  unequal  number  of  target  scatterers,  a  number 
only  J  -  M  of  reverberation  scatterers  must  be  accounted  for.  The  results  are  as  follows: 
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ccid=—  =k,  k  =  1,2 . M, 

y+1 


aid  =  ^  s  X  ,  k  =  M+l,M  +  2 . J, 

\|/+1 


and 


«k2  =  ~~  sv»  k  =  1,  2,...,M, 


ak2  =  k  =  M  +  l,  M  +  2,...,J. 


(B-19a) 

(B-19b) 


(B- 19c) 

(B-19d) 


Since  y  <  V  <  (J,  both  X  and  |i  are  negative  resuiting  in  L  (under  each  hypothesis)  being 
equivalent  to  the  difference  of  two  independent  gamma-distnOuted  random  variables.  Therefore, 


*'-2M  LM-'  y  (  M-')  -  r<J>  exp  (-  .  L>0. 

T(M)(k+^)j-m  V  y  T(J-M)  (tcU)mLm  I  k  , 


p(Ll^i)  = 


J-M-l 


A'J  (-L)J-M_I  \  ,^/J-M-l\  r(M+m)ig"^ _ /Ln  L<0>  (B-20a) 


- - — - -  y  (-l)m(  „  i - exp(- 

T(J-M)  0C+\)M  n^0  V  m  J  T(M)(K+X)™Lm  {X 
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and 


Mum  = - 1  V  (  m-1)  _nj)y^m 
r(Mxv+n)J-M  t v  m  '  ra-i 


exp 


r(J-M)  (v+|i)mLm  ^  v 


L  >  0, 


^M-1  |-L|1*'  'y‘(  nM+nQv'V"  /LV 

r(j-M)(v  +  n)M  „ (To  ^  m  '  r(M)(v+^)mLm 


L  <  0,  (B-20b) 


where  TO  is  the  gamma  function. 


This  yields  the  following  expressions  for  classifier  performance: 


p  _  y  KJ~Mr(J-M+m)  X™ 

11  ~  rJTo  (K+A.)J-Mr(J-M)m!  (K+X)m 


yl  KJ-Mr(J-M-Hm) 

rn=0  (K+A.)J~Mr(J-M)m!  (K+X)m 


y(M-m,Ti/ic), 


H  ^  o. 


rnCTl)-^"1  KJ~Mf(J~M+m) 

11  t^o  (K+X)J-Mr(J-M)m!  (K+\)m 


+  ]  y-1  XMr~(M+m) _ 1  icJ-M 

rio  (K+A),v,r(M)m!  (J-M-l-m)!  (k+^)J-m 


7( J— M— M- .r|/X ) ,ri  <  0. 
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while 
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r,  v  vJ-Mr(J-M+m)  jim 

P12UU  =  2-i  - - ~777Z - 

m=0  (v+ji)J_Mr (J-M)m !  (v+ji)m 


M-l 


- 1 

m=0 


VJ~Mr(J-M+m)  jim 

- : - — -  y(M-m,r|/v), 

(v+|i)J-Mr(J-M)m!  (v+ji)m 


ri>0. 


and 

vJ~Mr(J-M+m)  jim 

Pi2(ti)=  2-  - — - - - - — 

m=o  (v+ji)J-Mr(J-M)m!  (v+p.)m 


+  J  y  1  |J.Mr(M+m) _ 1 _ vJ~M 

m=o  (v+|i)Mr(M)m!  (J-M-l-tn)!  (v+^)J-M 


T1  <0, 


(B-21b) 


y 

where  y(a,y)  =  |  exp(-t)  t a_1  dt  is  the  incomplete  gamma  function. 
0 


Figures  B-5  through  B-8  plot  the  performance  curves  for  representative  values  of  (3,  \| r,  y,  M, 
and  J.  In  figure  B-5,  all  parameters  except  7  (reverberation)  are  held  constant.  In  this  example,  y 
runs  from  a  minimum  value  of  zero  to  its  limiting  value  of  y.  As  expected,  classifier  performance 
degrades  with  increased  reverberation. 

In  figure  B-6,  reverberation  is  set  to  zero  (7  =  0),  M  and  (3  are  held  constant,  and  J  is 
increased  (thus  V  decreases).  As  in  case  1  (figure  B-2),  the  diversity  phenomenon  is  evident.  For 
a  given  P^,  there  is  a  particular  set  of  targets  taken  from  the  two  classes  characterized  by  constant 
("and  now  equal)  receive  signal-to-noise  ratios  (M(3  or  J\y)  for  which  the  spreading  of  this  received 
energy  over  some  V  channels  provides  for  best  performance. 


In  figure  B-7,  reverberation  is  set  equal  to  a  constant  value  (y  =  y  /4),  M  and  (3  are  held 
constant,  and  J  is  increased  (these  parameters  are  comparable  to  those  in  figure  B-6).  When 
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compared  with  figure  B-6,  it  can  be  seen  that  for  a  given  P12  optimum  performance  can  shift  to  a 
new  set  of  targets  within  the  class. 

In  figure  B-8,  reverberation  is  set  to  its  limiting  value  (y=  y),  J  and  \\f  are  held  constant,  and 
M  is  increased  (thus  (3  decreases).  In  this  example,  hypothesis  2  corresponds  to  reverberation, 
while  the  target  in  hypothesis  1  spreads  its  fixed  receive  energy  over  an  increasing  number  of  cells. 
The  diversity  effect  implies  that,  for  a  given  P^,  there  is  particular  target  (from  among  the  class 
characterized  by  constant  receive  signal-to-noise  ratio)  that  is  most  easily  detected  in  the 
reverberation. 


SUMMARY 

Performance  measures  for  the  binary  classification  problem  have  been  addressed.  The 
sufficient  statistic  was  shown  to  be  a  Hermitian  form  in  the  Gaussian  vector  y.  A  method  for 
determining  the  necessary  probability  density  functions  via  characteristic  functions  was  given.  In 
general,  the  determination  of  the  characteristic  functions  requires  a  numerical  approach;  therefore, 
certain  assumptions  were  made  in  order  to  present  some  general  results.  A  number  of  examples 
were  presented  for  the  case  of  a  sonar  system  capable  of  infinite  resolution  and  targets 
characterized  by  independent  scatterers.  In  many  cases  diversity  was  present.  That  is,  for  a  given 
probability  of  misclassification,  there  was  a  best  partitioning  of  the  fixed  total  receive  energy  of  a 
target  among  the  cells  in  the  test  volume. 

A  treatment  of  the  general  problem  (finite  resolution,  correlated  scatterers)  will  provide 
further  in'ight  into  the  performance  capability  of  the  optimum  processor.  This  is  the  subject  of  a 
relatec  ..ivestigation,  the  results  of  which  will  be  reported  in  the  near  future. 
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